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ABSTRACT 



Aims. We study the mean number counts and two-point correlation functions, along with their covariance matrices, of 
cosmological surveys such as for clusters. In particular, we consider correlation functions averaged over finite redshift 
intervals, which are well suited to cluster surveys or populations of rare objects, where one needs to integrate over 
nonzero redshift bins to accumulate enough statistics. 

Methods. We develop an analytical formalism to obtain explicit expressions of all contributions to these means and 
covariance matrices, taking into account both shot-noise and sample-variance effects. We compute low-order as well as 
high-order (including non-Gaussian) terms. 

Results. We derive expressions for the number counts per redshift bins both for the general case and for the small 
window approximation. We estimate the range of validity of Limber's approximation and the amount of correlation 
between different redshift bins. We also obtain explicit expressions for the integrated 3D correlation function and the 
2D angular correlation. We compare the relative importance of shot-noise and sample-variance contributions, and of 
low-order and high-order terms. We check the validity of our analytical results through a comparison with the Horizon 
full-sky numerical simulations, and we obtain forecasts for several future cluster surveys. 

Key words. Surveys - Galaxies: clusters: general - Cosmology: large-scale structure of Universe - Cosmology: observations 



1. Introduction 

The large-scale structure of the Universe is a key test 
of modern cosmological scenarios. Indeed, according to 
the standard cosmological model, the large-scale struc- 
tures of the present Universe have formed through 
the amplification by gravitational insta bility of sma ll 
almost-Gaussian primordial fluctuations (|Peeblesl 119801) . 
Then, from obse rvations of the recent Universe, such as 
galaxy surveys (ICole et al.l 



Pacaud et al 
2007t 



2005; Tcgmark et al 



cluster survey s (lEvrardlll989t lOukbir fe Blanchardl Il992t 



2001 



j liniyrarui iiaoat iuukuu <y, jp iancnaiu uat>zt p, 
20071), wea k-lensing studies (jMassev et al.l |j 



Muns hi et al. 20081). or measures of baryon acous- 



tic oscillations ( Eisenstein et al.l 119981 [2005) , one can de- 
rive constraints on the cosmological parameters (e.g., the 
mean matter and dark energy contents) and on the prop- 
erties of the initial perturbations (e.g., possible deviations 
from Gaussianity) . Moreover, one can check whether these 
structures have really formed through this gravitational in- 
stability process. 

In the case of well-defined astrophysical objects, such 
as galaxies or X-ray clusters, which form a discrete popula- 
tion, standard probes are the abundance of these objects, 
that is, "number counts" , and their low-order correlation 
functions. Galaxies are governed by complex gas physics 
and star formation processes, which makes it difficult to re- 
late their abundance as a function of optical luminosity to 
theoretical predictions. However, since they are rather com- 
mon objects (one can reach a l most 1 6 galaxies in current 
surveys, e.g. lAbazaiian et al.1 (|2009l )) it is possible to re- 



construct halo density fields using subsamples of luminous 
red galaxies and to compare their power spectrum (i.e. the 
Fourier transform of their two-point cor relation) with the - 
ory to derive constraints on cosmology (|Reid et al.| [2010). 
In contrast, as the largest nonlinear objects in the present 
Universe, galaxy clusters are much rarer (current cluster 
samples have a den sity of about ten per deg 2 at most, e.g. 
|Ad ami et aD (|201ll )). but their relationship with dark mat- 
ter halo mass is controlled better. This means that their 
abundance is very se nsitive to cosmo l ogical parameters (es- 
pecial ly Cl m and as, lEvrar d| (11989D : lOukbir fc Blanchardl 
(1992])), and has already been used to derive constraints 
on cosmology, but their clustering has not yet provided 
much cosmological information (because of low statistics). 
However, upcoming cluster surveys should allow the use of 
both number count s and spatial cluste ring to derive cosmo- 
logical constraints (jPierre et al.ll20lH ). 

To compare observations with theory, one needs either 
to relate the observed properties of the objects (e.g., optical 
galaxy luminosity, X-ray cluster luminosity or temperature) 
to the quantities that are predicted by theoretical models 
(e.g., virialized halo mass) or to use se mi-analytical model s 
that attempt to build mock catalogs (|Harker et al.l [2007h . 
For clusters, one can use scaling laws between luminosity, 
temperature, a nd mass, calibrated on o bservations of the 
local universe (lArnaud et al.l 120051 120101 ). Then, one must 
take the selection function of the survey being considered 
into account, since the probability of detecting the objects 
is usually more complex than a sharp cutoff on mass or 
luminosity (| Pierre et all 120111 ). Finally, one needs to esti- 
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mate the error bars of the statistical quantities that are 
measured, in order to derive meaningful constraints on cos- 
mology. In addition to the uncertainties associated with the 
relationship between observed quantities (e.g., luminosity) 
and theoretical quantities (e.g., halo mass) discussed above, 
two unavoidable sources of uncertainty are the "shot-noise" 
effects of the discrete character of the population and the 
"sample variance" due to the limited size of the survey. In 
the case of a full-sky survey, the latter is also known as 
the "cosmic variance" , due to the fact that we only observe 
"one sky" so that there is only a limited number of low-A: 
modes to be measured. 

In practice dBenoist et al.l 119961: iMaller et all 120051: 
iNorberg et aTll2009H . one often estimates error bars from 
the data itself by subsampling the data and by computing 
the scatter between the means measured within each sub- 
sample (e.g., jackknife resampling). However, if one studies 
rare objects (e.g., clusters) or large scales, it is not possible 
to obtain reliable estimates from such subsamplings (be- 
cause of low-quality statistics). Moreover, one often wishes 
to estimate the signal-to-noise ratio of future surveys, even 
before they have been approved by research agencies, in or- 
der to evaluate their scientific possibilities and to compare 
the efficiency of diff erent probes. Then, one must use nu- 
merical simulation s (jPierre et al.ll201ll ICroton et al.ll2003 : 
iKazin et al.ll2010T ) or analytic methods. The former have 
the advantage of greater power (in the sense that one may 
explicitly introduce complex recipes for the formation of the 
objects, such as cooling processes and feedback, or intricate 
survey geometry), but are limited by finite resolution on 
large scales and for rare objects. Analytical approaches al- 
low one to describe a wider range of scales and halo masses, 
and usually provide faster computations. Hence they re- 
main a useful complementary method, which we investigate 
in this paper. 

Thus, in this paper, we present a general analyti- 
cal formalism for computing the shot-noise and sample- 
variance error bars of estimators of number counts and 
real-space two-point correlations for deep surveys that 
cover a significant range of redshifts. We consider both 
the 3D correlation function and the 2D angular correla- 
tion function on the sky. As explained above, this study 
is motivated by the need for such covariance matrices 
to compare any survey with theory. This extends over 
some previous works, which only considered the sample 
variance of number counts (|Hu fc Kravtsovj [20031 ) or ne- 
glected high-order or non-Gaussian terms in the sample 
variance of estimators for two-point correlations or power 
spectra (iFeldman et all 11994 iMaiumdar fc Mohrl 12004: 



lEisenstein et al.l 120051: Icohnl mm ICrocce et al 
Indeed, while the sample variance of number counts (i.e., 
the mean number of objects per unit volume) only involves 
the two-point correlation of the objects, the sample variance 
of estimators of the two-point correlation itself also involves 
the three- and four-point correlations (and so on for estima - 
tors of higher order correlation functions) (|Bernsteinl [l994) . 

Some previous studies have already included the con- 
tributions i_j3f_sjj£h_jfigj[^^ 

matrices (ISzanudi fc ColombilH99fil:lMeiksin fc Whitdfl QQQ: 



Scoccimarro et al.l 119991: lEisenstein fc Zaldarriagal 2001 
Smith! 20091 ). mostly in the context of galaxy surveys 



However, we extend these works by comparing the various 
contributions to expected error bars in detail, including all 
shot-noise and sample- variance terms, as well as high-order 



contributions, and by studying real-space two-point cor- 
relation functions instead of Fourier-space power spectra. 
Moreover, since we have the application to cluster surveys 
in mind, and more generally to deep surveys of rare objects, 
we consider quantities that are defined by integration over 
finite rcdshift bins. For instance, number counts may be as- 
sociated with bins Az = 0.1 while the two-point correlation 
functions are integrated over a significant redshift interval, 
such as < z < 1, to accumulate enough statistics. Then, 
the statistical quantities that we consider involve integra- 
tions along the line-of-sight, rather than local power spectra 
or two-point correlations in a small box at a given redshift. 
Moreover, for number counts we consider arbitrary angular 
scales, from small angles, where the Limber approximation 
applies, to full-sky surveys. In these various respects, our 
study fills a gap in published works. 

In view of the application to cluster surveys, we consider 
a population of objects defined by their mass M, and focus- 
ing on the case of dark matter halos, we use the halo mass 
function and bias measured in previous numerical studies 
for our numerical computations. To estimate the three- and 
four-point correlation functions (needed for the covariance 
of the two-point estimator), we use a simple hierarchical 
model (Peeb ieslll980l : lBernsteinlll994l ). which writes these 
higher order correlations as products of the two-point cor- 
relation. We give the explicit expressions of our results, 
which we also compare with numerical simulations, and we 
provide several realistic illustrations. In particular, we take 
advantage of our analytical formalism to compare the vari- 
ous contributions to the error bars and derive approximate 
scalings. Although we eventually apply our results to sev- 
eral future cluster surveys, our method is more general and 
could be applied to other objects (e.g., galaxies or quasars), 
defined by other quantities (e.g., luminosity), provided one 
has a model for their multiplicity function and their two- 
point correlation, and the three- and four-point correlation 
functions can be described reasonably well by a hierarchical 
model in the regime where they are relevant. 

This paper is organized as follows. In Sect. [2] we first 
briefly describe the analytic models that we use to estimate 
the means and covariance matrices of numbers counts and 
halo correlations, as well as the numerical simulations that 
we use to check the accuracy of our results. Then, we study 
the halo number counts per redshift bins in Sect. [3] This 
allows us to introduce on a simple example our approach 
to evaluate the mean and the covariance matrix of various 
estimators. We consider the cases of both small angular 
windows and arbitrary angular windows (including full-sky 
surveys), and we estimate the accuracy of small-angle ap- 
proximations and the decay in correlations between distant 
redshift bins. Next, we study the real-space 3D halo cor- 
relation function in Sect. |3J We consider both the Peebles 
& Hauser and the Landy & Szalay estimators and com- 
pare their covariance matrices. We also discuss the relative 
importance of different contributions to these covariance 
matrices (shot noise/sample variance, low-order/high-order 
terms). Then, we investigate the halo angular correlation 
in Sect. [5J using the same approach. Finally, we apply our 
formalism to several real survey cases in Sect. El and we 
conclude in Sect. [7] 

We give details of our calculations in several appendices. 
We discuss shot-noise terms in App. [Al in the simple case 
of number counts, and finite-size effects in App. [B] Then, 
we describe our computation of the mean and covariance 
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of the estimators of the 3D halo correlation in Apps. [C] 
and|Dl f° r the Peebles & Hauser estimator, and in App. [E] 
for the Landy & Szalay estimator. We give further details 
on high-order contributions to the covariance matrices in 
App. [I] for the 3D correlation, and App. [Hi for the angular 
correlation. 



2. Halo density fields 

Before we describe our analysis of the covariance matri- 
ces for halo number counts and correlation functions, we 
present in this section the analytic models that we use for 
the underlying halo distributions (mass and bias function, 
etc.) and the numerical simulations that we use to validate 
our results. 

2.1. Analytic models 

2.1.1. Halo mass function and correlation 

To be consistent with the numerical simulations, in 
Sects. [3~T1 to 151 where we develop our formalism and com- 
pare our results with simulat ions, we use the WMAP3 cos- 
mology (|Spergel et all 120071 ). that is, O m = 0.24, n dc = 
0.76, n b = 0.042, h = 0.73, ct 8 = 0.77, n s = 0.958, and 
Wd c = — 1. In Sect. [6j where we apply our formalism to 
obtain forecasts for current and f uture surveys, we us e 
the more recent WMAP7 cosmology (jKomatsu et al .1120 111) , 
that is, Cl m = 0.274, n dc = 0.726, fi b = 0.046, h = 0.702, 
(is = 0.816, n s = 0.968, and Wd c = — 1. 

In this paper, keeping in mind the study of X-ray clus- 
ters, we consider the number counts and correlations of 
dark matter halos defined by the nonlinear density con- 
trast S = 200. These halos are fully characterized by their 
mass, and we do not investigate the relationship between 
this mass and cluster properties such as the gas tempera- 
ture and X-ray luminosity. These scaling laws can be added 
to our formalism to derive the cluster number counts and 
correlations, depending on the quantities that are actually 
measured, but we keep a more general setting in this paper. 

We use the halo mass function, d n/dlnM, of 
ITinker et all (|2008l ). and the halo bias of iTinker et al] 
( 20101 ). Thus, the two-point correlation function be- 
tween two halos labeled "i" and "j" can be factorecQ in 
as 

^=& j 6 3 -e(|x i -x J -|;z), (1) 

where £ is the matter density correlation, and the bias fac- 
tors bi and bj do not depend on scale, 

bi = b{M u Zi). (2) 

This approximation of scale-independent halo bias is valid 
to better than 10% on sca les 20 < r < 130/i _1 Mpc 
(M anera k Gaztan aga 2009}), with a small feature on the 
baryon acoustic scale (r ~ 100/i -1 Mpc) of amplitude of 5% 
(jDesiacaues et al1l2010l ). 



1 A weaker hypothesis would be to write $j = bfj^Xif, z), 
where the dependence on Mi and Mj does not factor in (i.e., 
bij 7^ bibj), but keeping the factorization with respect to the 
relative distance Xij . This only slightly modifies our expressions 
in a straightforward manner. 



Throughout most of this paper we assume that corre- 
lations are negligible over cosmological distances (of or- 
der c/Hq), so that the redshift z on the right-hand side 
of Eq.([T]) can be taken at will as Zj or Zj (or the mean 
(zi + Zj)/2). In Sect. 13.2.21 where we consider the case of 
large angular windows (and go beyond the flat sky and 
Limber's approximations), we do not use this approxima- 
tion but replace the matter density correlation by its lin- 
ear approximation. This yields the alternative factorization 
^ cx b(M i} Zi)b(Mj, z^D+iz^D+izj^Loi^^j) that al- 
lows one to handle arbitrary redshifts z% and Zj . 

For the nonlinear matter correlation £(x;z), and the 
Fourier-space nonlinear power spectrum P{k). defined by 

t;{x;z) = J dke ik x P(fc;z), (3) 

w e use the popular fit ting formula to numerical simulations 
of ISmith et all (120031) . 

Throughout this paper, all angular number densities are 
in units of deg~ 2 . 

2.1.2. Three-point and four-point halo correlations 

The covariance matrices of the estimators £ for the halo 
two-point correlation £ h also involve the halo three-point 
and four-point correlation functions, £ h and rj h , so we must 
define a model for these quantities. On large scales, for 
Gaussian initial conditions, the three- and four-point cor- 
relation functions of the matter densit y field behave as 
C ~ £ 2 and 77 ~ g 3 at low est order over £ (jBernardeau et all 
2002; Go roff et alj|1986t ). On small sc ales, these scaling law s 
remain a reasonable approximation (jColombi et al.l ll996j. 
but with numerical prefactors that are different from the 
large-scale ones (and may slightly vary with scale). On 
the other hand, fo r rare massive clusters, using the stan- 
dard approach of iKaiserl ()1984[ ) where virialized objects 
are i dentified with ove r dense regions in the linear density 
field, iPolitzer k Wisel (|l984j ) obtain 1 + £ h (xi, .., x N ) = 
rii>j[l + £ h ( x «' x j)}- Since our goal is only to estimate the 
magnitude of high-order contributions we consider in this 
article a simple "hierarchical clustering ansatz" , where the 
TV— point correlation function can be expressed in terms of 
products of (N —l) two-point correlation functions th rough 
tree diagrams (|Groth k Peebleslll977l: [Peebles! 1 1980D . This 
is the simplest mode£| that is in qualitative agreement with 
large-scale theoretical predictions and small-scale numeri- 
cal results, as well as with observations. 

Through a comparison with numerical simulations, we 
check that the accuracy of this model is sufficient for our 
purpose, which is to estimate signal-to-noise ratios and 

2 An alternat iv e w ould be to use a halo model 
(|Coorav fc Shethl [2002), coupled to perturbation theory 
predictions on large scales. This could provide more accu- 
rate estimates, since such an approach is able to describe 
low-order corr elation functions well f rom very la r ge to 
small scales ([ Scwximarro et all 120011 : iGiocoli et ah! 120101 : 
Vala geas fc Nishimichil 12010. 201U ). However, this would in- 
troduce several contributions associated with "1-halo" up to 
"4-halo" terms, and would require additional parameters such 
as halo occupation functions and high-order bias parameters, 
depending on the objects that one considers. Therefore, we do 
not investigate this approach in this paper (although it would 
certainly deserve further attention), and we restrict ourselves 
to the simpler hierarchical models, as described in Eqs.([4])-([6]). 
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Fig. 1. The "hierarchical clustering ansatz" for the three- 
point correlation function (j 1 2 3 °f Eq. (0} ■ Each solid line 
corresponds to a two-point correlation £, and ( h is written 
as the sum of these three diagrams, with a multiplicative 
factor 61626353/3. 



compare different survey strategies (while we would require 
a higher accuracy for the computation of the means them- 
selves, that is, the number counts and two-point correla- 
tions that we wish to measure) . The advantage of this sim- 
ple model is that it describes all scales, through the scalings 
recalled above, and does not require additional free param- 
eters. 

iSzapudi fc Colombil 



Thus, 



Bernstein 



<11996h . [MeTksin fc White! (|199 
halo correlation function as 



we write the three-point 



Ci,2,3 = b ib 2 b 3 — [6,26,3 + 6,16,3 + 6,16,2] , 



(4) 



where we sum over all three possible configurations over 
the three halos labeled "1" , "2" , and "3" . This corresponds 
to the three tree-diagrams shown in Fig. [TJ We use a linear 
bias mode0, as in Eq.((T]), and for the matter density nor- 
malization factor S3, we take its l arge-scale limit, which is 
obtained by pertu rbation theory (|Peebleslll980t lFry||1984 
iBernardeau et al.ll2002h . 



S3 



34 

y 



(n + 3), 



(5) 



where n is the slope of the linear power spectrum at the 
scale of interest. Within the same "hierarchical clustering 
ansatz" , the four-point correlation function is expressed in 
terms of products of three two-point functions, as shown 
in Fig. [5J We have two possible topologies, and sixteen dif- 
ferent diagrams for four distinct halos. For simplicity, in 
this work we give the same weight to all sixteen diagrams, 
independently of their topology, as 



Vi,2,3,4 = bib 2 b 3 b 4 y| [6,26,36,4 + 3 eye. 
+6,26,36,4 + 11 eye] , 



(6) 



where "3 eye." and "11 eye." stand for three and eleven 
terms that are obtained from the previous one by permu- 
tations over the labels "1,2,3,4" of the four halos. Again we 
take for S4 its large-scale limit, 



60712 62. . 7. o , 2 
S4 = W - T (» + 3) + 3(» + 3)». 



(7) 



3 Within a local bias model, one writes the halo density field 
as 5h = ~^2 k bk5 k /k\, where 5 is the matter density contrast 
smoothed on large scales. Then, the halo many-body corre- 
lations also depend on the higher order bias coefficients bk 
(Fry & Gaztanaga 1993), but for simplicity we only consider a 
linear bias model here (i.e. bk — for k > 2). 



Fig. 2. The two topologies of the four-point diagrams as- 
sociated with the "hierarchical clustering ansatz" for the 
four-point correlation, as in Eq.([6]). The numbers are the 
multiplicity factors of each diagram. 



This is the simplest possible model, where the angular 
dependence only comes from the decomposition over the 
terms in brackets in Eqs.(U]) and ©. More complex models 
and exact computations at lowest order of perturbation the- 
ory would introduce homogeneous kernels Qjv(xi, .., Xjy) 
(in place of the numbers Sn) that al so depend on the 
angle s between the vectors Xj and x 3 ([Scoccimarro et ail 
1999). Observations of galaxy clustering show, for instance, 
that Q 3 displays a weak dependen ce on the triangle shape , 
while remaining clo se to unity (|Gaztanaga et al.l [2005; 
iKulkarni et al.l l2007tl . Here we simply take the constant 
value Q3 = 53/3. 

In terms of the halo two-point correlation, Eqs.Q 
and (0 imply halo coefficients S% that behave as S% ~ 
b 2 ~ N S N . For 6 - 1 and n ~ -1 and from Eqs.© and ©, 
this gives the values Sf ~ S3 ~ 2.9 and S} ~ 5 4 ~ 13.9, 
which roughly agree with observations of galaxy clusterini 
(ISzapudi et alj|2001t ICroton et all 120041: iRoss et al.ll200T 
Mar in et al.ll200 



2.1.3. Flat-sky and Limber's approximations 

In this paper we often encounter quantities, such as the 
mean matter density correlation over a redshift bin j, 
Zj^- < z' < Zj t +, with respect to some redshift z in a second 
bin i, integrated over some angular window of area (Af2), 



Xj,- 



v{z) J (Any 



(8) 



where x = (x,Df2) and x' = (x',2?'f2'). Here \( z ) anc l 
T>(z) are the comoving radial and angular distances, and 

we introduce the factor 1/V(z) so that £con is dimension- 
less. Equation © is a "conical" average, within the ob- 
servational cone. However, for small angular windows it is 
possible to use a flat-sky approximation and to approxi- 

(i) 

mate this "conical" average 6on by a "cylindrical" average 

6^yi- Thus, using Eq.© and assuming that the correlation 
£ is negligible on cosmological scales, we write for circu- 
lar angular windows of radius S , for a redshift z that also 
belongs to the j-bin, 



<z<z 



Xj,+ , 



dx' fdddO' 



dk 



K ._ V J (^)2 

x e ik r ( x '- x )+iu ± .v(e'-0) p(fc;z)j (Q) 

and ^cyi( z ) = if z does not belong to the j-bin. Here k\\ 
and are the longitudinal and transverse components of 
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k, with respect to the line of sight, while 9 and 8' are the 
2D transverse angular vectors. 

For a redshift binning that is not too small, Ax 3> T>6 S , 
longitudinal wavenumbers above 1 / ( A\) are suppressed 
by integrating \' along the line of sight, and the inte- 
gral is dominated by wavenumbers with k ~ k± and 
k± ~ 1/(T>9„) . Thus, using the Fourier form of Limber's ap- 
proximation (lLimberlll953l). which is widely used in weak- 
lensing studies flKaiserJ [1992J; iMunshi et aT] 12008( 1. the in- 
tegration over \' yields a Dirac term (27r)5_D(fcii), and the 
integration over k» gives 



(10) 



with 

= f / ^ / dk ^ ^ V[e '- e) P(*±; (ii) 

which does not depend on the size of the redshift bin j 
(because we have taken the limit of a very large redshift 
bin). Introducing the 2D Fourier-space circular windowQ, 



w 2 (k ± ve s )= /^ e — - 2 -MJW". 



kiVe s 



we obtain 



£cyl(z) = 7T 



dfc A 2 (fc, z) ~ 



(12) 



(13) 



where we defined the 3D power per logarithmic wavenum- 
ber, A 2 (fc,z), by 



A 2 (fc,z) = 4irk 3 P(k,z). 



(14) 



We will evaluate the accuracy of this approximation, based 
on the flat-sky and Limber's approximation, in Sect. 13.2.31 

2.2. Numerical simulations 

Our analytical formalism allows us to consider a broad 
range of scales and halo masses, from small angular win- 
dows up to full-sky surveys, and to compare the relative 
contributions to covariance matrices that arise from shot- 
noise and cosmic variance effects, and from low-order and 
high-order large-scale correlations. Numerical simulations 
do not easily allow such a detailed analysis; however, in 
order to validate our approach, we must check whether it 
agrees with estimates from simulations, wherever a com- 
parison is possible. 

We use the high-r esolution full-sky Horizon simulation 
(iTevssier et alJ I2009D . based on the WMAP3 cosmology 



Spergel et aDl2007t l. This is a 68.7 billion particle N-body 



simulation, featuring more than 140 billion c ells in the 
AMR grid of the RAMSES code (|Tevssierll2002|) . The sim- 
ulation consists in a lightcone spanning the entire sky up 
to redshift ~ 1, with a mass resolution of 1.1 x 10 10 M©. 

Halos in the (2 G pc) 3 N-body simulation are found with 
the HOP algorithm (jEisenstein fc HutJll998ll . Their comov- 
ing positions are then converted into sky coordinates, tak- 
ing their radial velocity into account when calculating red- 
shifts. The physical effects of the baryons are neglected and 



4 For more complicated angular shapes we can still define a 
Fourier-space window WzfaxDOs): normalized by Vl^O) = 1, 
but it will also depend on the direction of kx- 



the total mass is given by the number of particles inside 
the halo. We only consider halos at rcdshifts z < 0.8, up to 
which the simulation is complete towards all directions. 

We design simulated surveys by extracting rectangu- 
lar fields in angular coordinates. To minimize the effect 
of intrinsic sample correlations, we impose a 10 (resp. 20) 
deg gap between consecutive fields when computing num- 
ber counts (resp. correlation functions), which yields 138 
(resp. 34) nonoverlapping fields that can be cut out in the 
simulation. 

For the purpose of clustering analysis, auxiliary random 
fields are constructed by shuffling the angular coordinates 
of halos in the data fields, thus preserving the halo mass 
and redshift distributions. To gain in computational effi- 
ciency, the number of halos per random field is ten times 
the average number of halos in data fields. The Landy & 
Szalay estimator is then scaled accordingly to the ratios of 
pair numbers in data and random fields. 

The mean and covariance of all quantities of interest are 
estimated by sample averaging over the extracted surveys. 
Because of the uniqueness of the simulation, a residual noise 
is expected whenever the area of individual fields becomes 
large and their number diminishes. Thus, there are about 
41253deg 2 /(AJ7) nonoverlapping fields of area (Af2). (For 
instance, we cut 41 fields of 400 deg 2 for the analysis of the 
angular correlation function.) 



3. Number density of halos 

3.1. Mean number counts in redshift bins 

We consider a population of objects defined by some prop- 
erty, such as their mass M, with a mean comoving number 
density per logarithmic interval of M written as dn/dlnM. 
Then, the mean number of objects in the redshift interval 
[z,z + dz], within the solid angle dCl on the sky, with a 
mass in the range [M, M + dM] , reads as 



dCl dN = dz d£l 



dV 



dzdfl 



dM 



dn 



M dlnM 



(M,z) 



(15) 



where |dy/dzd$7| is the cosmological volume factor, which 
is given by 



dV 



dzdfl 



(16) 



and x( z ) an d 2? (z) are the comoving radial and angular 
distances. In Ea. ([l"5")) and in the following we define N as 
the number density of objects per unit area on the sky, in- 
stead of the total number of objects within a given window 
(AO). This choice is more convenient for practical purposes 
because it allows a simpler comparison between different 
surveys that have different total areas. 

We can split the interval of mas^B over several bins "a" , 
[M a _,M ai +] and the observational cone over nonoverlap- 
ping redshift intervals "i", [zi,_, Zj +] with Zj + < In 
practice, one usually takes Zi+ = Zi+i - so as to cover a 



5 Although M stands for the mass of the objects, as for the 
mass function of clusters of galaxies, or of galaxies themselves, 
it can also represent any other quantity, such as temperature, 
luminosity, or a vector made of several such quantities. 
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continuous range of redshifts. Then, the number of objects 
per unit area, in the bin (i,a), reads as 



r z i,+ 



dz — V 

dz 



/AO (AO) /»,„,_(,) M d In M } 



where dn/dlnM is the observed density of objects. Here 
and in the following, we note observed quantities by a hat 
(i.e. in one realization of the sky) to distinguish them from 
mean quantities, such as the comoving number density of 
Eq. (|15[) . that correspond to expectation values over many 
realizations. In practice, assuming ergodicity, these expec- 
tation averages are assumed to be identical to volume aver- 
ages (in the case of statistically homogeneous and isotropic 
cosmologies). 

To simplify notations we define the mean cumulative 
number density of objects observed at a given redshift, 
within the mass bin a (with boundaries that may depend 
on z), 



, ,M °' +W dM dn ,„ . 



(18) 



and we omit the explicit boundaries on mass in the follow- 
ing. Then, the mean number of objects per unit area in the 
redshift and mass bins (i, a) reads as 



(Ni, a ) = [ ,,+ d X V 2 n a . 



(19) 



r:. 



M>2xl0 13 /j- 



M>10 l4 h-'M,- 



M>5xlO l4 A- , M„ 



Fig. 3. The mean number density of dark matter halos per 
square degree, within redshift bins of width Az = 0.1. We 
count all halos above the thresholds M* — 2 x 10 13 ,10 14 , 
and 5 x 10 14 /i _1 M Q , from top down to bottom. We com- 
pare our analytical results (solid lines) with numerical sim- 
ulations (dashed lines). 



We plot the mean number counts (Ni >a } of Ea. (fT9|) 
in Fig. [3J per square degree, for redshift bins of width 
Az = 0.1. Here we select all dark matter halos above 
a mass threshold M*, with M, = 2 X 10 13 , 10 14 , and 
5 x 10 14 /i~ 1 M Q . The error bars are the 3 — a statistical 
errors obtained from the covariance matrices derived in 



Sect. 13.2.11 for 138 fields of 50 deg 2 as used in the sim- 
ulations. We can check that our estimates agree reason- 
ably well with the numerical results. The small discrep- 
ancies are probably due to the complex relation between 
theoretical halo masses and the actual halos found in the 
simulation box. In particular, it is well known that using 
different algorithms, such as spherical-overdensity, HOP or 
friends-of-fr iends algorithms, can le a d to slightly differe nt 
results (e.g. lEisenstein fc Hutl (j 1 998ft : iTinker et all (|2008h ). 
However, this point is beyond the scope of this paper, as we 
only wish here to check that our analytical results provide 
reasonable estimates of the mean and covariance of halo 
number counts and correlations. 

3.2. Covariance of number counts 

As usual we define the covariance Ci yCC -j^ of the statistical 
quantities N^a and Nj,p by 

C i>a .j,p = (Ni, a N j>p ) - (Ni, a ){N jt p). (20) 

As recalled in App. [3] following iPeeblesI (|1980h . it can be 

decomposed over "shot-noise" and "sample-variance" con- 
tributions, 



c 



(s.v.) 



which write from Eq. (|A.10[) as 



C 



(s.n.) 



$ a. 



(for nonover lapping mass binning), and 



(Ni,a) 



(21) 



(22) 



C 



(s.v.) 



d X V< 



dn 



dM dn 



(Aft) J a M dlnM 



(Aft) J M> dlnM' 



Here we denote f. and J as the integrals over the red- 
shift and mass bins i and a. The superscript "h" refers to 
the "halo" correlation function, which depends on the two 
redshifts, angular directions, and masses (or temperatures, 
etc.), £ h = £ h (M,x; M',x'), with x = (x,£>f2). 

The first term C, „ n '-^ is the shot-noise contribution and 
vanishes for nonoverlapping bins. As expected it decreases 
with the survey size as l/(Aft). (We recall that N is the 
angular number density.) The second term C^^^g is due 

to the "samp le- variance" cross-cor relation £ h between dis- 
tant objects (|Hu fc Kravtsovll2003l) . Using the approxima- 
tion (JJJ, that is, the factorization of the dependence on mass 
and distance of £ h , we define the mean bias b a at redshift 
z, for the mass bin a, through 



, dM , . , , , dn , , , . 
b a {z)n a (z) = I —b(M,z)-^jj(M,z), 



(24) 



where n a was defined in Eg. (ITS)) , so that Ea. (|2"3"| also writes 
as 



C 



(s.v.) 



<h V- b„TJ, t I d\ ' V" b ,77 ; / £(x, x'). 

(25) 
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3.2.1. Small angular windows 

Using the approximation that the correlation function is 

negligible on cosmological scales, whence V ~ T>,b ~ b 
and n ~ n, we obtain 



C 



(s.v.) 



(26) 



r<( s - v -) _ X 



dx T> 5 b a bp n a np Zcyi, 



(27) 



"O Q.l 



(Af!)-50 deg 2 



M>2x 10 13 h-'M c 



M> 10 14 i7 -'M,. 



where £con(z) was denned in Eq.©. It is a "conical" aver- 
age, over objects "i" and "j" that are located at unrelated 
positions (x, H) and (x',Sl'). To recall this "conical" inte- 
gration along x' we have added the subscript "con". This 
helps distinguishing quantities such as © from other av- 
erages of £ over 3D spherical shells, which we encounter in 
Sect. H] below. To further distinguish from the quantities 
encountered in Sects. 0] and [5j we put the label j, which 
refers to a redshift bin, as a superscript, whereas the labels 
i or j of the radial or angular bins studied in Sects. 0] and 
[3] appear as indices. (The parenthesis refer to the fact that 
in Limber's approximation of wide redshift bins the depen- 
dence on the boundaries of the bin j disappears, because 
they are pushed to infinity, see Eo. flTBl .') 

In the case where the nonoverlapping redshift bins are 
large enough to neglect the cross-correlation between dif- 
ferent bins (we evaluate the accuracy of this approximation 
in Sect. 13.2.41) . the integral (JSJ) gives rise to a Kronecker 
factor Sij. Moreover, for small angular windows and large ^— 1 
enough redshift bins we can use the flat-sky approxima- 
tion where the "conical" average is approximated by 
a "cylindrical" average (and spherical harmonics are re- 
placed by plane waves), and Limber's approximations ([TU)) . 
where longitudinal wavenumbers are neglected over trans- 
verse wavenu mbers. Substitu t ing in to Eq. (|2"6")l we recover 
the results of iHu fc Kravtsovl (|2003h . 



M>5xlO"h-'M r 



G - 



Fig. 4. The variance ON i of the halo angular number den- 
sities of Fig. [3j for redshift bins Az = 0.1 and an angular 
window of 50 deg 2 . We compare our analytical results (solid 
lines) with numerical simulations (dashed lines). 




where £ C yi(z) was defined in Eq. dTBl . (We evaluate the accu- 
racy of the approximation (|2~T1) in Sect. 13.231 ) Thus, while 
the shot-noise contribution ((22]) to the covariance matrix 
is diagonal, the sample-variance contribution (1271) is only 
block-diagonal (for large redshift bins) since within the 
same redshift bin different mass bins are correlated. 

In Fig. 2] we compare with the numerical simulations 
our results for the variance of the halo angular num- 
ber densities, where includes both the shot-noise and 
sample- variance contributions (|21[) . 



G Ni 



a 



(s.n.) 



(s.v.) 



(28) 



We consider an angular window of 50 deg 2 , which cor- 
responds for inst ance to the case of the XXL survey 
(|Pierre et al.ll201lt ). As for the mean densities of Fig. [3J we 
obtain a reasonable agreement with the simulations, and 
we correctly reproduce the dependence on halo mass and 
redshift. 

Taking advantage of our analytical model, we compare 
in Fig. [S] the shot-noise and sample- variance contributions 
to the total error that was displayed in Fig. 01 but going up 
to redshift z — 2. Here we define 



<c 



(s.n.) 



a 



(s.v.) 

Ni 



c 



(s.v.) 



(29) 



Fig. 5. The shot-noise (dashed lines) and sample-variance 
(solid lines) errors for the angular number densities shown 
in Fig. [3J associated with a redshift binning of width Az = 
0.1, but up to z = 2, and an angular window of 50 deg 2 . 

As expected, we can see that the error of observed number 
counts Ni is dominated by the shot-noise contribution for 
rare halos (high mass or high redshift), where effects as- 
sociated with the discreteness of the halo distribution are 
very important. 

Signal-to-noise ratio 

From the angular number density (Ni) and its variance 
<7Ni we define the signal-to- noise ratio as 



S_ 

N 



VNi 



(30) 



which we compute from Eqs. (fT^)) and (|2"5)) . Thus, combin- 
ing Figs. [3] and SI we display in Fig. [5] this signal-to-noise 
ratio. In agreement with these previous figures, we obtain a 
good match to the numerical simulations. Thus, the analyt- 
ical results are competitive with the numerical simulations 
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(An)=50 cleg 3 



M>2xl0 13 ii- 1 M e 
I 1 .- 



M>10 14 A- I M_ 



M>5xl0 14 h 'M„ 



o ; 
z 



Fig. 6. The signal-to-noise ratios of number counts for an 
angular area AO = 50 deg 2 , as in Figs. [3] and 01 We com- 
pare our analytical results (solid lines) with numerical sim- 
ulations (dashed lines). 



since they appear to be no less reliable and much faster to 
compute. 

Scalings with survey area and number of subfields 

For practical purposes it is interesting to compare the 
signal-to-noise ratios associated with a different number 
of subfields at fixed total area AO, since this can help 
for choosing the best observational strategy, whether one 
should perform a single wide-field survey or several smaller 
scale surveys. 




Fig. 7. The signal-to- noise ratios of number counts for a to- 
tal angular area AO = 50 deg 2 , divided over Af independent 
subfields. We show the results obtained for the numbers of 
subfields TV = 1 (solid lines), 2 (dashed lines), and 4 (dotted 
lines) . 



Therefore, let us consider a survey with a total angu- 
lar window of area AO that can be split over Af angular 
subfields, which we assume to be independent and to have 
equal area AO /Af. For instance, the survey may be made of 



Af smaller regions that are well separated on the sky. Then, 
the total angular number density TV* ' of objects in the red- 
shift bin [zi,-, 2t,+]) summed over the Af smaller subfields 
of index a with angular number densities , writes as 



(31) 



Since all subfields arc independent and have the same depth 
we have, for any a, 



(«)\ 



(32) 



which depends neither on (AO) nor Af. Without mass bin- 
ning the covariance matrix remains diagonal, see Eq (|27[) . 
with a shot-noise contribution 



r tot,(s.n.) _ J_ r (a),(s.ix.) _ (N- ot ) 

TV l ' 1 (AO) 



cx (AO)" 1 , 



while the sample- variance contribution is 



a 



tot,(s.v.) 



J_ r (a),(s.v.) 

Af '* 



(33) 



(34) 



where £ C yi,(a) i s the typical value of the integral ([TcH) within 
the small angular window (AO) /Af. Then, the signal-to- 
noise ratio scales as 



S_ 

N 



(AO) 



(An) 



£cyl,(a) 



(35) 



In the regime where the covariance is dominated by the 
shot-noise contribution, the signal-to-noise ratio grows as 
the square root of the total area and does not depend on 
the number of subfields, (S/N) cx V AO. 

To estimate the scaling in the regime where the co- 
variance is dominated by the sample-variance contribu- 
tion, we assume that on the relevant scale k ~ l/(T>6 s ), 
that is, k ~ yAf] (I? 2 AO), the power spectrum behaves 
as P(k) ~ k n , with —2 < n < 1. (Wavenumbers where 
n < — 2 for CDM cosmologies would correspond to very 
small angular windows.) Then, from Ea. lfTS")) we obtain 
£ cylj(Q) - k n+2 ~ (7V/AO)("+ 2 )/ 2 , and the signal-to-noise 
ratio scales as (S/N) - Af~ n/i (AO)("+ 2 )/ 4 . Thus, in this 
regime the signal-to-noise ratio still grows with the total 
survey area, but there is a weak dependence on the number 
of subfields, which may either increase or decrease with Af 
depending on the sign of n. 

For illustration, we show in Fig. [7] the signal-to-noise 
ratios of the number counts obtained for a total angular 
area AO = 50 deg 2 , divided over Af subfields with Af = 1, 
2, and 4. The case of a single field, Af = 1, corresponds 
to Figs. [3] and |U The curves in Fig. [7] are exact estimates 

of the signal-to-noise ratios, obtained from (N- ot ) / J C^f' 
and not from the approximate scaling given in Eg. (1351) . 

In agreement with the discussion above and with Fig. 01 
we can check that at high redshift or for high mass, the 
signal-to-noise ratio does not depend on Af, since the error 
is dominated by the shot-noise contribution, which only 
depends on the total area as seen in Eas. (|55)l and (pJ5"j) . 



8 



P. Valageas et al.: Covariance matrices for halo number counts and correlation functions 



At low redshift and low mass, where the error is dom- 
inated by the sample-variance contribution, the signal-to- 
noise ratio increases slightly with Af. This can be under- 
stood from the fact that the local slope n of the power spec- 
trum is slightly negative on the scales where the halo cor- 
relation is significant. For instance, within the ACDM cos- 
mology that we consider in this paper, a window of area 50 
deg 2 corresponds at z = 1 to a radius T>0 S = 164/i~ 1 Mpc, 
and the local slope n(k) of the linear power spectrum, 
at wavenumber k ~ 27r/(164/i _1 Mpc), is n ~ —0.6. This 
means that for the number counts at low redshift and low 
masses, it is slightly advantageous to choose a survey di- 
vided over several independent subfields. 

As shown by Figs. 11.11 and 11.21 in App. HI these scalings 
are approximately satisfied by the results obtained from 
numerical simulations, for a wide variety of survey area and 
of number of subfields. Therefore, the scalings derived from 
Eq. (j3"5)) allow a reasonable estimate of the dependence of 
the signal-to-noise ratio of number counts with (Af2) and 
JV. 



3.2.2. Large angular windows 



10 5 j 
10- 1 - 



10 3 w 



O. 1 s- 
10" 
10- 



M>2xlO' a h- | M r 



M> 10 14 /3 -'M, 



exact 
limber 



M>5xlO«i-iM r 




Fig. 8. The angular power spectrum of the distribution of 
halos in the redshift bin 0.95 < z < 1.05. We plot both the 
exact result (1381) (solid line) and Limber's approximation 
([39]) (dotted line). 



Expression (|27[) relies on the approximation of small an- 
gular windows for the sample-variance contribution. This 
allowed us to use the flat-sky approximation (|9]), where the 
observational cone over the redshift bin Zi +] is ap- 

proximated as a cylinder of radius 9 S around the central line 
of sight. For large angular windows this approximation is 
no long er valid and we must d ecompose over spherical har- 
monics (|Hu fc Kravtsovl l2003). rather than over the plane 
waves of Eq.©. 

Rather than using the Eqs. ([2"5|) or (|26p . with a new ex- 
pression for ^con(^) that would remain valid for large an- 
gles, it is more convenient to go back to the angular number 
densities N, as in Eg . (fT5|) . Thus, for any redshift bin i we 



expand the observed distribution iVj(n) on the sky over the 
spherical harmonics, 



and we define the angular power spectrum as 



(£,m) 



"N, 



)^(s.v.) _ ^ ^ § mm , C 



(36) 



(37) 



where we only take the sample-variance contribution. To 
simplify notations we do not include mass binning, but 
this can be added without difficulty, as in Sect. 13.2.11 
Then, writing the two-point correlation function again un- 
der the factored form JT]), introducing the Fourier-space 
power spectrum as in Eq.Q and expanding the plane- wave 
exponential fac tor over spherical harmonics, a s tandard cal- 
culation gives (|Hul 120001: iHu fc Kravtsovll2003j) 



a 



= An 



dk 
k 



Aio(fc) d X x 2 bnD +n (k X ) 



x / d X 'x' 2 bn'D' + j e (kx'), 



(38) 



where ji is the spherical Bessel function of order I. Here 
we assumed for simplicity a flat background, which is suf- 
ficient for practical purposes, and we approximated the 
Fourier-space power spectrum by the linear power spec- 
trum, P(k,z) ~ D + (z) 2 PLo(k), where D + (z) is the linear 
growth rate (normalized to unity at z — 0). 

Limber's approximation can be recovered in the limit of 
large £, for slowly varying fc-dependent prefactors, by using 
the property / dfc k 2 jz{kx)jt{kx') = tt/( 2x 2 )S d (x - y'), 
and the correspondence k (£ + 1/2)/% ()Hu fc Kravtsovl 
120031: lLo Verde fc Afshordl l2008). This yields 



C 



Limbc 



= & 



d X X 5 b n 2 



2tt 2 



A 



£ + 1/2 



(39) 



Nevertheless, because the structures of Eas. (l3"51) and ([2^| 
are quite different (the order of the integrations over red- 
shift and wavenumber is exchanged, and the large-angle ex- 
pression keeps two integrations over redshift, while in the 
small- angle expression one integral over redshift has already 
been performed) it is more convenient to treat the small- 
angle and large-angle derivations separately. 

We plot in Fig.[8]the angular power spectrum Ci t i-i ob- 
tained for halos above three mass thresholds in the redshift 
bin 0.95 < z < 1.05. We can see that Limber's approxima- 
tion (j39|) significantly underestimates the power at low £, 
while it slightly overestimates the power at high multipoles, 
£ > 20. It is already rather good at £ ~ 20, and becomes 
increasingly accurate at higher £, although the difference 
remains on the order of 10% until £ ~ 80. These results 
agree with previous s tudies of the Lim ber ap proximation 
(lLoVerde fc Afsh.ordill2008t ICrocce et al.ll201ll) . As noticed 
in lLoVerde fc Afshordil (|2008l) . the latter can be extended 
as a series expansion over (I + 1/2)" 1 , but higher orders 
behave increasingly badly at low £. Therefore, we do not 
investigate further this approach here, since using the exact 
expression Q38p is not more difficult (but slower) to compute 
and ensures a smooth behavior over all £, while the usual 
Limber approximation (|39[) is sufficient for our purposes on 
small scales. 
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sample variance 
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Fig. 9. The shot-noise (dashed line) and sample-variance 
errors (l29t for the angular number densities in the redshift 
bin 0.95 < z < 1.05, as a function of the radius 9 S of the 
angular window. The solid line is the exact sample variance, 
from Eas. ((3"8[) and (I4T1) . while the dotted line is the result 
(|2"T)l . which was used in Fig.[S]and involves both the fiat-sky 
and Limber's approximations. 



Next, the mean angular number densities Ni of Eq. flTTl) . 
smoothed over the angular window of radius 9 S and filter 
W 2 (tt), read as 



Ni 



dnNi(n)w 2 (n), 



(40) 



where Wa(fi) = l/(Af2) within the angular window and 
vanishes outside (but we can choose more general filters). 
Th en, the sample varian ce of these number counts writes 
as (|Hu fc Kravtsovll200l 



,™)|2 



i.m 



(41) 



while 



where Cij-i is the angular power spectrum 

are the angular multipoles of the window W 2 , 

W\ l - m) = J dCl W 2 (£l) Y e m (n)*. (42) 

For a top-hat window that is symmetric around the az- 
imuthal axis, we have for I > 1, 



W, 



2tt 



d9 sm9Y e °(9) 



tt Pe-i(cos9 s ) - P e+1 (cos9 s ) 



21+1 



(An) 



and 



W. 



(0,0) _ 



and (AQ) = 2tt(1 - cos 9 S ), 



(43) 
(44) 

(45) 
= 



where Pi are the Legendre polynomials, and W 2 ' ' 
for m^0. 

In the limit of large I, Y e °(9) ~ yJ^^J [(£ + 1/2)9} 
(|Hull2000h . and we obtain \W { 2 m \ 2 ~ W 2 (k X 9 s ) 2 , with 



k = (£+1/2) jx and W 2 the 2D Fourier-space window (fH)) . 
This shows that for small angles, where the covariance is 
dominated by large I, the expression (|4T|) goes to the flat- 
sky approximation (|27[) . using the fact that Limber's ap- 
proximation (I3U1) also applies in this limit (see Fig. 

We plot in Fig . |H] the shot-noise and sample- variance 
errors <Ji — y/Ci,i, as in Fig. 0] but as a function of the 
angular radius 9 S , for the angular number densities in the 
redshift bin 0.95 < z < 1.05. In agreement with Fig. [51 
we can check that the combination (|27|1 of the flat-sky & 
Limber's approximations provides a good approximation 
to the exact result (|4T|) on small angles, typically 9 S < 10 
deg, while it underestimates the sample variance on large 
angles. In agreement with Fig. [SJ the shot-noise contribu- 
tion is dominant for massive and rare halos, and subdomi- 
nant for small and numerous halos. In our case, the transi- 
tion between the shot-noise and sample- variance dominated 
regimes takes place at Ai* ~ 10 14 h~ 1 M Q . 

3.2.3. Accuracy of the "flat-sky + Limber" approximation 
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Fig. 10. The ratio <r££t / (T^ Umhci of the exact sample- 
variance error (|4"T]) to the approximation ([27[l. which uses 
both the flat-sky and Limber's approximations. We show 
this ratio as a function of the radius 9 S of the angular win- 
dow, for several redshift bins, for halos above the mass 
threshold M > 10 14 /i _1 Af Q . Higher z corresponds to a 
higher ratio. 



We plot in Fig. [TU] the ratio of the exact sample- variance 
error (j4Tj) to the approximation ((27)) . which used both the 
flat-sky and Limber's approximations. As in Figs. [5] and 
H3 we can check that the approximation (|27)1 is reliable 
for small angular windows but significantly underestimate 
the sample-variance error for wide angles, above 10 deg. 
In the extreme case of full-sky surveys (9 S — 180 deg), 
it can underestimate the sample-variance error by a fac- 
tor from 2 to 5. The effect is actually greater for higher 
redshift bins. This may seem somewhat surprising since 
the "flat-sky" approximation ([9]) is expected to be more 
accurate at higher redshifts, where large angles 9 corre- 
spond to large distances T>9 that are weakly correlated and 
should not significantly contribute (i.e., the CDM power 
spectrum itself yields more weight to pairs separated by 
small angles). However, Limber's approximation (|10[) goes 
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in the opposite direction because it relies on the assump- 
tion A% 3> T>9 S (i.e., longitudinal wavenumbers are more 
suppressed by the integration along the line of sight than 
transverse wavenumbers, which are only integrated over the 
smaller angular distance T>0 S ). For instance, for an Einstcin- 
de Sitter universe, where \ = = 2c/iJ [l — (1 + z)^ 1 / 2 }, 
this constraint on the angle 9 S and the redshift bin width 
Az writes as 9 S < 29(Az)[(l+z) 3 / 2 -(l+z)]- 1 deg. At fixed 
Az this upper bound on 9 S becomes stronger at higher z. 
Therefore, Fig. HU1 shows that this second effect, associated 
with Limber's approximation, dominates over the first ef- 
fect, associated with the flat-sky approximation. 

We checked that we obtain very close results for other 
mass thresholds (not shown in the figure). For instance, 
the curves obtained for the mass threshold 2 x 10 13 and 
5 x 10 14 ft- _1 M Q cannot be distinguished from those plotted 
in Fig. 1101 This is not surprising since within our bias model 
|T]) the halo correlations are governed by the same matter 
density correlation function £(r; z). 

Figure [10] shows that the small-angle approximation 
(|2"7| that we used in Sect. 13.2.11 was legitimate, since we 
considered angular windows of 50 deg 2 or less (i.e. 9 S ^ 4 
deg). 

3.2.4. Correlation between different redshift bins 




Fig. 11. The correlation matrix W^'J of Ea. (|4"6"]) between 
redshift bins of width Az = 0.1. We show Wt'i as a func- 



(s.v.) = 



1 at 



tion of j, for four values of i. In each case, 1Z 

j = i. We consider halos above lO 14 ^ -1 Mq and an angular 
window of area (Aft) = 50 deg 2 . 



We can use the expression (|4"Tj) to compute the correla- 
tion between different redshift bins i and j. Thus, we show 
in Fig.[TT]the correlation matrix TZf'^^ (also called normal- 
ized covariance matrix) defined as 
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(s.v.) 



c 



(s.v.) 

hj 



^(s.v.Jpts.v.) 



(46) 



where we only consider the sample-variance contribution. 
(The shot-noise contribution (|22p is always diagonal for 



nonoverlapping redshift bins.) Thus, TZ^j- is unity along 

the diagonal and elements {i,j} where 7Z| S . V is much 
smaller than one are weakly correlated. We can check that 
the decay is always rather fast and correlations between 
neighboring redshift bins, j = i ± 1, are already below 
10%. This shows that it is appropriate to neglect cross- 
correlations between redshift bins of width Az = 0.1, as 
we did in Sect. 13.2.11 We also checked that we obtain al- 
most identical results for other angular windows, such as 
(Aft) = 400 deg 2 . (For very large or full-sky surveys we do 
not need the approximation of uncorrelated redshift bins 
since we use Ea. (j4i"j) .') 



4. Real-space two-point correlation function 

In the previous section we have studied the covariance of 
the estimators Ni, which measure the redshift distribution 
dn/dz of the population of interest (galaxies, clusters, etc.), 
over a set of finite redshift bins. This corresponds to one- 
point statistics. We now study estimators of the real-space 
two-point correlation function £(£12; Mi, M2; z) of these ob- 
jects, which corresponds to two-point statistics, as a func- 
tion of the comoving distance x\2- In this article we do 
not investigate redshift-space distortions, which we leave 
for future works, and we assume that a real-space map of 
the population under study is available or that redshift dis- 
tortions can be neglected. 

Estimators of 3D correlation functions, or power spec- 
tra, have already been studied in many works, mostly in 
view of their application to galaxy surveys. However, since 
we have in mind the application to cluster surveys and, 
more generally, to deep surveys of rare objects, we consider 
3D correlation functions averaged over a wide redshift bin 
(in order to accumulate a large enough number of objects), 
rather than the usual local 3D correlation functions at a 
given redshift. This means that the quantities that we con- 
sider in this section, while being truly 3D correlations and 
not 2D angular correlations, nevertheless involve integra- 
tions along the line of sight or, more precisely, the obser- 
vational cone, within a finite redshift interval. This is also 
why 3D Fourier-space power spectra may not be the most 
convenient tool for our purposes, since we do not have ho- 
mogeneous and isotropic distributions since the radial di- 
rection plays a special role. 

4.1. Mean correlation 

4.1.1. Peebles & Hauser estimator 



Following lPeebles fc Hauserl (|l974h . a simple estimator £ for 
the two-point correlation function of a point distribution is 
given by 



1=^-1, 
^ RR 



(47) 



where D represents the data field and R an independent 
Poisson distribution, both with the same mean density. 
More precisely, the estimator ^ introduced in (|47|) for 
the mean correlation over the radial bin [Ri,-, Ri,+] cor- 
responds to counting all pairs "DD" in the data field that 
fall in this pair-separation bin i and all pairs "RR" in the 
auxiliary Poisson field that fall in the same bin, and to 
taking the ratio of these two counts. 
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Before appropriate rescaling, the mean number density 
of the actual Poisson process R is taken as much higher 
than the observed one, so that the contribution from fluc- 
tuations of the denominator RR to the noise of £ can be 
ignored. The advantage of form (14"T|) is that one automati- 
cally includes the geometry of the survey (including bound- 
ary effects, cuts, etc.), because the auxiliary field R is drawn 
on the same geometry. 

In our case, we write the analog £j of Eq. (|4"T| for the 
mean correlation on scales delimited by Ri,- and , in- 
tegrated over some redshift range and mass intervals, as 



1 + 



with 



1 / 


dz 




Qi;a,/3 J 


dz 








X / 


dr' 


I 


JRi,- 







V 1 



dM' 



dfi 

(AO) 

dh 



dM 
dn 



M' dlnMdlnM' 



(48) 



dfl f dM f R ' + , , f dM' 



z j (AO) J a M jRi 
dn dn 



dlnMdlnM' 



(49) 



Here we denoted J^'' + dr' as the integral over the 3D spher- 
ical shell of radii < and J a and are the inte- 
grals over the mass bins a and j3. 

The redshift interval Az is not necessarily small, and 
to increase the statistics we can choose the whole redshift 
range of the survey, such as [0, z s ]. If we bin the survey 
over smaller nonover lapping redshift intervals, which are 
large enough to neglect cross-correlations between different 
bins (see for instance Fig. [11]), we can independently study 
each redshift bin. For simplicity we do not explicitly write 
the redshift boundaries. 

As in Eq.flHJ), the counting meth od that underlies 
Ea.(l48l) can be understood as follows (jPeebles fc Hauserl 
1974). We span all objects in the "volume" (z, n,lnM), 
and count all neighbors at distance r', within the shell 
[Ri,-, Ri,+], with a mass M', We denote with unprimed 
letters the quantities associated with the first object, 
(z, n,lnM), and with primed letters the quantities asso- 
ciated with the neighbor of mass M' at distance r'. Thus, 
with obvious notations, dn/dlnM and dn/dlnM' are the 
observed number densities at the first and second (neigh- 
boring) points. The difference between the quantities (l+£) 
and Q is that in the latter case we use the mean number 
densities dn/dlnM and dn/dlnM'. Therefore, Q is not a 
random quantity so it shows no noise. In practice, the mean 
number densities dn/d In M may actually be measured from 
the same survey, as described in Sect. [3l However, since 
these measures do not involve a distance binning over r' , 
there are many more objects in a redshift bin than within 
a small interval [Ri.-, Ri,+]- Then, the one-point quantities 
(N) are measured with much greater accuracy than (£) , so 
that we can indeed neglect their contribution to the noise of 
the estimator £. In terms of Eq. (|4"T)) this corresponds to ne- 
glecting fluctuations of "RR" . (This is achieved in practice 
by choosing a much higher density for the field R, which is 
later rescaled.) 

In Eq. (j4"5)) we used a simple average over the shell 
[Ri,-, Ri,+], because we count all pairs with a uniform 



weight in r'-space. Through the change to spherical coordi- 
nates dr' = dr'r' 2 dJ7', this yields a geometrical weight r' 2 
in terms of the radial distance r' . An alternative would be 
to add a weight r'~ 2 , instead of the simple 3D top- hat writ- 
ten in Eq. (|48[) . to eventually obtain a uniform weight over 
the radial distance r' . For simplicity we only consider choice 
(|4"5|) in the following, but such alternative weights could be 
used with straightforward modifications in the expressions 
given below. 

Thus, we focus on the behavior of the two-point corre- 
lation as a function of distance r', measured through the 
binning over the intervals [Ri,-, Ri,+\- We assume that dif- 
ferent bins do not overlap, < and in practice 
one usually has = R4+1.-, to cover a continuous range 
of scales. On the other hand, these intervals may depend on 
redshift, as long as Ri t +(z) < i?i+i i _(z) at each redshift. 

Using Eq.(IHl), Eq.(|4H]) also writes as 



(50) 



Qi;a,0 = J dxV> 2 n a npVi 



where the volume V, of the i-shcll is 



4-7T 

V t (z) = Y [R h+ (z) 3 ~ R^(z) 3 



(51) 



which may depend on z. In practice, one would usually 
choose constant comoving shells, so that Vi does not de- 
pend on z. To obtain Eq. (|50|) we used that dn/dlnM and 
dn/dlnM' have no scale dependence (because they cor- 
respond to a uniform distribution of objects) and we ne- 
glected edge effects. (These finite-size effects are discussed 
and evaluated in App.[Bj) 

Because of the finite distance r' between the two 
objects M and M' in Eq. (l48l) . there is no shot- 
noise contribution to the average of the quadratic term 
(dn/dlnM) x (dn/dlnAf). Within the framework pre- 
sented for Eos. (|A.l[) - (|A.2p . the integration in Eq. (|48]) does 
not contain common small (infinitesimal) cells, because of 
the finite-size distance r' > Ri-- Therefore, the average of 
the statistical estimator (l48t reads as 



1 



dn 



d x V 2 



dn 



dn 



dM f , , f dM' 
dr ' M^ 



(AO) J a M 
(i[nl/(lhiM7 [l + £V;M,M';z)], (52) 



where £ h (r'; M, M'; z) is the two-point correlation function 
of the objects, as in Eq. (|A.9p . Here we denoted f. dr' as the 

integral Jj^*' + dr' over the 3D spherical shell i. Comparing 

with Eo. (|5"0"|) we clearly see that ^ is an unbiased estimator 
of the two-point correlation function £ h , averaged over the 
shell [Ri,-, Ri,+] (with a geometrical weight r' 2 ), whence 

the name . 

As in Eq. (|50"]) . in Eq.([5l2j) and in the following we neglect 
finite-size effects, which arise because the integration over r' 
should be restricted to the observational cone of the survey. 
This leads to a smaller available volume than the spherical 
shell [Ri-,Ri y+ ] close to the survey boundaries. This does 

not affect the mean value of the estimator £ , , because this 
effect cancels out between the numerator of Eq. (15^1) and the 
denominator Qi. However, it will have a small effect on our 
estimate of the covariance matrix. As described in App. [Bj 
at z = 1 for a circular survey area AO = 50deg 2 , and for 
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a radial bin at r = 30/i _1 Mpc, by geometrical counting we 
overestimate the number of pairs by 10% and the signal-to- 
noise ratio by 5%. 

As in Sect. 13.21 in order to make progress we assume that 
the two-point correlation can be factored in as in Eq.([T]), 
so that Eq. (|5"2l reads as 



1 



Qi;a,8 



dr' 



dx T> 2 b a b(3 n a n/3 V» £,P (z 



(53) 



(54) 



(£i;a,p) - 
with 

We have introduced the superscript "(r)" to recall that 
Ea.([54|) is the radial average of £, over the 3D spherical shell 
associated with the radial bin i, to distinguish it from the 
angular averages that we encounter in Sect. [S] below. The 
prime in the subscript V also recalls that we integrate 
over a neighboring point r', with respect to a given point 
(X,2?n) of the observational cone, to distinguish it from 
the integration over an unrelated point within the observa- 
tional cone as in the "cylindrical" average ©. We give in 

Eq. (|C.3p in App. [C]the Fourier-space expression of Q,(z), 
which is more convenient for numerical computations. 



0<z<0.8 
(AQ) = 50 cleg 3 




M>2xlO l3 ii- 1 M 



r [ h-'Mpc ] 



10 2 



Fig. 12. The mean halo correlation, (£, LS ), over ten co- 
moving distance bins within 5 < r < lOO/i^Mpc, equally 
spaced in log(r). We integrate over halos within the redshift 
interval < z < 0.8 and we compare our analytical results 
(solid lines) with numerical simulations (dashed lines). 



4.1.2. Landy & Szalay estimator 

As shown in lLandv fc Szalavl (|1993l ). a better estimator 
than l[4"7]) is given by 

DD - 2DR + RR 



LS 



RR 



(55) 



which involves the product DR between the data and the 
auxiliary field. Within our framework, where the mean 
quantity Q plays the role of R 7 this second estimator reads 
as 

Pls 1 r . dy _o r dfl f dM f . , f dM' 



Si;a,8 

dn 



dz — V 

dz 



dn 



dlnMdlnM 
dM 



-2- 



(An) 
i 



M 



dr' 



dr' 



dM' dn 



dz^V 2 
dz 

dn 



M> dlnMdlnM' 



1 



M> 

dn 

(An) 

, (56) 



The difference between the terms associated with DD and 
DR is that in the former we have a product of two observed 
number densities, (dn/dlnM) x (dn/dlnM'), while in the 
latter we have a crossproduct between the observed and the 
mean number densities, (dn/dlnM) x (dn/dlnM'). 

As checked in App.lEl the mean of this second estimator 
ita 8 1S ec l ua l to the mean of the estimator £, ;Q ,/3 studied in 
Sect. KFH 

(£1?) = (57) 

To simplify the notations, in the following we do not con- 
sider binning over mass (i.e., we independently consider the 
correlation functions of halos above some mass thresholds), 
so that Eq. (j53l readily simplifies as 

1 



Q, 



>-r2 



d X V 2 fn 2 V,&\z) 



(58) 



and a similar simplification holds for Qi. If needed, it is not 
difficult to include a mass binning in the expressions given 
in the following. 



4.1.3. Comparison with simulations 

We compare in Fig. the mean correlation (|55| with re- 
sults from numerical simulations (which use the Landy & 
Szalay estimator) for halos above the thresholds M > 2 x 
10 13 and lO 14 /i _1 M , within the redshift range < z < 0.8. 
The error bars are the 3 — a statistical errors obtained from 
the covariance matrices derived in Sect. 14.2.21 for 34 fields 
of 50 deg 2 as used in the simulations. We obtain reason- 
able agreement with the simulations, although we appear 
to underestimate the halo correlation of the most massive 
halos at small radius, r < 7/i _1 Mpc. This may be due to 
a scale-dependent halo bias or to a small discrepancy in 
the definition of t he halo mass, whic h depends on the halo- 
finder algorithm ([Knebe et al.ll20lTh . 

4.2. Covariance matrices for the halo correlation 

We now consider the covariance of the estimators and 
£, LS . As described in App.[Dl the covariance of the Peebles 
& Hauser estimator is given by 



(59) 



with (see also lLandv fc Szal ay (1993) for a computation of 
low-order terms) 



r (2) - s- ■ 

°« " fltJ (An)Q 2 



, ^ 2 dn, dM f, dM,, 
dXiV; , - — — — / dr, ; 



(An) Mi 



Mi, 



dn dn r „ . , 
- -- -- 77. [* + &] 



dlnM dlnM,- 



(60) 



C 



(3) 



, _ 2 dn, dM, r dM,, 

dxi T>\ t-t^t -r^- / dr.// ■ 



(AQ)QiQj J " l (An) Mi J,"" Mr 

dMj, dn dn dn 

' Mj, din Mi dlnM,, din My 

x [l + €i,%> + + + Q,i',j'] i 



dr,- 



(61) 
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C 



(4). 
ij ' 



QiQj 

f a Xj T> 



dn t dM, 



/dr, 

(AQ) Mi Ji 1 M v din Mi din M* 



dMj/ dn 



dn 



C 



i2 dfij dMf 
(AO) Mj- 



dn 



dn 



Ij ry Mj, din Mj dlnMj 



(62) 



where £ h , £ h , and n h , are the two-point, three-point, and 
four-point correlation functions of the objects. To make 
the expressions compact but easy to understand, we in- 
troduced the following notation in Egs. (f60l) - ((62"]) . Variables 
associated with the object at the center of the Vi-shell arc 
noted by the label i (e.g., X i,Mi,..) and those associated 
with the object within the Vi-shell are noted by the label i' 
(e.g., Ti>, -Mj', ..). This corresponds to the primed and un- 
primed variables in Egs. (j4"8")) and (1521 . and we may speak 
of objects and j' . Then, in the indices of the cor- 

relation functions, we separate with a semicolon, as in 
of Eg. ([6"2"]) . objects i and j that are located at unrelated 
positions (xi, fij) and (Xjj^j) m the observational cone, 
whereas we separate with a comma, as in of Eq.(|rjQ")). 
objects that are located at a fixed distance r', (More pre- 
cisely, the distance r' is restricted to a radial bin V.) 

The label refers to quantities that involve n distinct 
objects. Thus, the contributions and arise from 
shot-noise effects (as is apparent through the prefactors 
l/(Af2)), associated with the discreteness of the number 
density distribution, and they would vanish for continuous 
distributions. However, they also involve the two-point and 
three-point correlations, and as such they couple discrete- 
ness effects with the underlying large-scale correlations of 
the population. In case of zero large-scale correlations, they 
remain nonzero because of the unit factors in the brackets 
and become purely shot-noise contributions, arising solely 
from discreteness effects. 

More precisely, contribution (|60l) arises from the cou- 
pled identification i = j and i' = j' (or i = j' and i' = j), 
whereas contribution (f6"Tj) arises from the single identifica- 
tion i = j (or either one of i — j' , %' = j, %' = j'). Thus, in 
Eq. foTj) the object i is at the center of both shells V, and 

Contribution is a pure sample-variance contribu- 
tion and does not depend on the discreteness of the number 
density distribution (hence there is no l/(Af2) prefactor). 

As shown in App. [El the covariance matrix of th e 
Landy fc Szalay estimato r reads as (see also lSzapudil (|200ll) ; 
iBernardeau et all l|2002h ) 



^LS _ n LS(2) ^LS(3) ^LS(4) 



where the first term is equal to Eg. lfuT))) . 



r LS(2) „(2) 



(63) 



(64) 



and 



C. 



LS(3) 



, _ 2 dfl, dM, f dM v 



(AtyQiQj J (Ail) Mi 

, . dMji dn dn dn 
x / dr,-/- 



M;i 



Mj< d In Mi d In M v d In Mj> 



x + Ci,i',j'] 



(65) 



,LS(4)_ 



1 



hj 



Qi Q j 



dft, dM, 



dr,v 



dM v dn 



dn 



(AO) Mi J, 1 Mi> dlnMj d In M* 



j d K V j 
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dMi 



(An) Mj 

+ ^iti'jj'. 



dr >' M 



dM,v dn 



dn 



j' 



d In Mj d In Mj 



(66) 



By comparison with Eas. ((6Tj) - (162|) we can see that 
many terms have been canceled (jLandv fc Szalavl 119931 : 
Szap udi fc Sza lay 1998). This confirms that the estimator 
((56)) is more efficient than (|48)l . since its covariance will be 
smaller. 



4.2.1. Low-order terms 

In this section we assume that the radial bins Ri,+] 
are restricted to large enough scales to neglect three and 
four-point correlation functions, as well as products such as 
£i.ji£i>.j- We compute these high-order terms in Sect. 14. 2 .121 
and Figs. II 51 and [TBI show the range where they can be ne- 
glected. Along the diagonal, for halos above 1O 14 /i _1 M0 
this corresponds to the full range 5 < r < lOO/i -1 Mpc. 
For lower mass halos, M > 2 x 1O 13 /i _1 M , all scales re- 
ceive significant contributions from high-order terms, but 
the low-order terms contribute to about 50% for r < 15/i _1 
Mpc. This is sufficient for our purposes in this section, 
which are to compare the Peebles & Hauser and the Landy 
& Szalay estimators, the shot-noise and sample-variance 
effects, and the scalings with survey area and number of 
subfields. Accurate computation of the covariance matrix 
requires taking all terms into account, which we do in 
Sect.SjH 

Thus, in this section we only keep the contributions 
that are constant or linear over the two-point correlation 
function £ h of the objects, and we again assume that the 
two-point correlation function can be factored as in Eq.((T|). 
Then, as shown in App. |Dl for the Peebles & Hauser esti- 
mator we obtain from Egs. (|6T))) - (f6"2")) . at this order, 



r, J ~ Si ' j (An)Q. 



■(! + &» 



d X V 2 rf VtV 3 



(AtyQiQj 



d X V 5 bVVjVi&yl, 



where we introduced 



(67) 



(68) 



Following the notation explained earlier, below Eg . (|f]2"|) . the 

Tr) 

comma and the primes in Q, '■, mean that this is a "spherical 
average" , more precisely the average over the two spherical 

shells Vi and Vy, in contrast to (£1, in Eq.©, which was a 
"conical" average within the observational cone. There are 
two indices, i' and j' , because we integrate over the two 

shells Vi and Vj, whereas in of Eg. (f54"|) there was only 
one index i' because we integrated over a single shell Vj. 
The Fourier-space expression of Eo. (|68p . which can be con- 
venient for numerical computations, is given in Eg. (|D.10l) 
in App. |Dj 
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Fig. 13. The covariance matrices Cj L | (solid line) and Cij 

(dashed line) of the estimators and £i, for i = 4 asso- 
ciated with the distance bin 12.3 < r < 16.6/i Mpc, as a 
function of j. We show the results obtained for halos in the 
redshift range < z < 0.8 with an angular window of 50 
deg 2 . Here we only consider the low-order terms given by 
Eqs.(E3 and ®. 



Again, to obtain Eq. (J57J) we neglected finite-size effects, 
that is, we did not take the fact into account that close 
to the boundaries of the survey part of the shell V; is not 
observed. As explained in App. [B] this only leads to an 
overestimate of 5% of the signal-to-noise ratio, for a radial 
bin of SO/i^Mpc in a circular survey window of 50deg 2 . 
This error decreases for wider surveys or smaller radial bins. 

For the Landy & Szalay estimator, at the same order 
the covariance matrix reads from Eqs. ([p^)) - (|rj5|) as 



°« ~ J (AQ)Qi 



(! + <&» 



x / d% V 2 b 2 n 3 ViVj ^ ' jr . 



,(r) 



(69) 



Again, as compared with Eq. (jrjT]) several terms have been 
canceled. Moreover, at this order only shot-noise terms, 
whether coupled to large-scale correlations or not, con- 
tribute to the Landy & Szalay covariance (|S§|) . as can be 
seen from the prefactors 1/(A£1). In contrast, at the same 
order in the Peebles & Hauser covariance (167)) , we have two 
more shot-noise terms (coupled to the large-scale correla- 



tions through the means £j, and Cf 1 ) an d one additional 
sample- variance-only contribution (i.e., the last term, with- 
out the prefactor l/(Af2)). 



Comparison of Peebles & Hauser and Landy & Szalay 
covariance matrices 

We show in Fig. 1131 one row of the covariance matrices 
Ci.j and Cj'j, as a function of j at fixed i. We consider 
halos in the redshift range < z < 0.8, for a window of 50 
deg 2 . The covariance is larger for the case of higher mass 
threshold. In agr eement with Eqs.(l67l) and ((69]) and with 
standard results (|Kerscher et al.ll2000t ). the covariance of 
the Landy & Szalay estimator (1551) is smaller than for the 



Fig. 14. The contributions C {2) and C (3) to the covariance 
of the Landy & Szalay estimator, along the diagonal i = j. 
As in Fig. 1131 we only consider the low-order terms, given 
by Eq. ([#]). 



Peebles & Hauser estimator (14"T1) . especially for the lower 
mass threshold (the higher mass threshold case being more 
dominated by the common shot- noise contribution (I64[) ). 

As shown by Fig. [T3l another advantage of the Landy & 
Szalay estimator is that its covariance matrix is much more 
diagonal than for the Peebles & Hauser estimator. This 
can be checked by comparing the left and middle panels 
of Fig. [Tj3 where we show the correlation matrices IZi.j 
defined as in Eg. (14"6"1) . but where we include all shot- noise 
and sample- variance contributions of Eas. ((67|) and (j69|) . 



Comparison of sample-variance and shot-noise effects 

We compare in Fig.[14]the contributions (first term 
in Ea. flMl) ') and (second term in Ea. ([rj9l ). again keep- 
ing only these low-order terms. We consider the same survey 
properties as in Fig. [T3] but plot these contributions along 
the diagonal, i — j. Let us recall that both contributions 
and are shot- noise contributions (i.e., they arise 
from the discreteness of the halo distribution). However, 
they also involve the underlying large-scale correlations, as 
apparent through the factors £. In particular, , which 
arises from a single pair identification, vanishes if there are 
no large-scale correlations, whereas C^- 2 \ which arises from 
two pair identifications, remains nonzero if £ = (the term 
associated with the factor 1 is thus a "pure shot-noise" con- 
tribution). Therefore, by comparing and we can 
assess the relative importance of shot-noise and sample- 
variance effects, C^ 2 ) involving an extra degree of shot noise 
(one more pair identification). As expected, is dom- 
inant for small distance bins, which correspond to small 
volumes, V{ oc r 3 , and contain few halos. It also remains 
dominant up to larger scales in the case of more massive 
halos, which are rarer. Since the contribution C*- 2 -* is diag- 
onal, as shown by the Kronecker prefactor in Eq. (|r77|) . it 
implies that covariance matrices are more strongly diago- 
nal for high-mass halos, as can be checked in Fig. [19] where 
we show the correlation matrices TZi.j of small (upper row) 
and large (lower row) halos. 
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Scalings with survey area and number of subfields 



As in Sect. 13.2.11 we consider the dependence of the 
signal-to-noise ratio on the total survey area An and on 
the number Af of subfields. Thus, we define the estimator 
£LS,tot ag ^ e mean Q f tne estimators |^ S '^ Q ^ of Eg. (156)) of 
the subfields, 



£LS,tot 



1 N - 

= 77 ^ 



LS,(a) 



(70) 



a = l 



Of course, the expectation value is independent of (Ail) 
and Af, 



( |LS,to t) = (e 



LS,(a)v 



is independent of (Ail) and Af. (71) 



From Eg. ([50)) we can check that does not depend on 
(An) nor Af, so that for each subfield a, of area (An) /Af, 
the covariance ([6"9")l scales as 



(An)' 



(72) 



Both terms in Ea.(|M| scale in the same fashion, so that 
the structure of the covariance matrix does not change with 
(An) nor Af (i.e. it does not become more or less diagonal), 
if we neglect boundary effects. Then, the covariance matrix 
of the averaged estimator ([TO")) scales as 



C 



LS.tot 



1 



-.c 



LS,(a) 



1 



oc 



Af^ iJ "(An)' 
so that the signal-to-noise ratio scales as 



S_ 

N 



LS,tot\ 



c 



LS.tot 
i,3 



cx a/ (An). 



(73) 



(74) 



Therefore, a single wide- field survey and a combination of 
several independent smaller surveys, with the same total 
area, show the same efficiency. This is because both terms 
in Eq.(|69p scale in the same way with the survey geome- 
try, as l/(An), because the sample- variance effects involved 
in these mixed contributions arise from the correlation be- 
tween objects separated by a distance r < Ri,+ + Rj,+ , 
independently of the angular size of the survey. This is dif- 
ferent from the sample- variance contribution (|27[) to the 
covariance of the number counts, which explicitly depends 
on the large-scale correlation over the survey angular size 
9 S , see Eq. (fT3"|) . because it arises from the correlation be- 
tween objects located at any position in the survey cone. 
Of course, result (l74|) only applies to small length scales, 
+ Rj t+ <C T>9 S , where it is legitimate to neglect finite- 
size effects. For long wavelengths a wider survey is clearly 
more efficient, and the only possible choice for scales that 
are close to the larger survey diameter. 

4.2.2. High-order terms for the covariance of £ LS 

We now estimate the high-order terms for the covariance 
Cjj of the Landy & Szalay estimator £f s that we ne- 
glected in Ea.(|69p. where we only kept terms of order zero 
or one over the two-point correlation function. To evalu- 
ate the contributions associated with the factors C h ,, ,v in 
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Fig. 15. The low- and high-order contributions to the co- 
variance matrix Cfj along its diagonal. We again consider 
halos in the redshift range < z < 0.8, with an angular 
window of 50 deg 2 , above two mass thresholds. 



Eg. (l6"51 and rjfy.jj, in Eg. ([Ml), we use the model for the 
three- and four-point halo correlation functions described 
in Sect. 12.1.21 Then, as shown in App. [F) the contribution 
associated with the product i n Eg. ([SB)) is given by 



C 



LS(«) 
i,3 



QiQj 

the term C]V,i' of Ea. ((55|l yields 



(75) 



C LS(C) = 4 

lJ (An)Q l g J 



d X P 2 &Vv^ 



^>i' A^i' A' ^j'.i^i', i' 



(76) 



and the term vti'-j j> °f Ea. ([6"6")) gives 



C 



hS( V ) ^ 
i,3 



lA r ) v C _I_ O C ( r ) v Ar) V f 



i 9c( r )t. .A r ) i A r ) A r )A r ) , A r ) A r ) A r ) 

^j'',i^'i3^j;i' ' ^j' ;z>z,z' ' ;j ^i' ^j' ',i 



(77) 



where the various factors are given in App. [T] and we used 
for Eqs. ([75|) - ([77|) the "hierarchical clustering ansatz", de- 
scribed in Figs. [TJ and [2] and given by Eqs.([4]) and ([6]). 

The terms C^j® and C 4 L |^' > are "pure sample- 
variance" contributions. Thus, there is no prefactor l/(An) 
and they involve large-scale correlations among four ha- 
los, i,i',j,j'. The term Cfj® is a coupled shot-noise and 
sample-variance contribution, as shown by the prefactor 
l/(An) and the fact that it involves large-scale correla- 
tions among three halos, i,i',j'. (The discreteness of the 
halo distribution has led to the identification i = j, i.e. a 
shot-noise effect, which leaves three distinct halos.) 

We compare in Figs. [TBI and [TBI the low-order contribu- 
tions (|69|) with these high-order contributions d75l) - ([77l) . We 
can see that the latter can be non-negligible on these scales, 
5 < r < 100/i _1 Mpc. Along the diagonal, i = j, shown in 
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Fig. 16. The low- and high-order contributions to the co- 
variance matrix Cjj, as in Fig. [15] but along one row. This 
corresponds to the fixed bin i = 4, associated with the dis- 
tance bin 12.3 < r < 16.6ft. _1 Mpc, as a function of j. 



Fig. 1151 they are always significantly smaller than the low- 
order contribution (which includes both sample-variance 
and shot-noise effects) for massive halos, M > 1O 14 /i _1 M0, 
but are close to it or larger for M > 2 x 10 13 ft. _1 Af Q . On 
large scales the main high-order contribution is the term 
([75]) . associated with a product while the terms ([76]) 
and (177)) , associated with the three- and four-point correla- 
tion functions, dominate on small scales. Indeed, the former 
does not increase much on small scales, whereas the latter 
are very sensitive to the smoothing scales Rt and Rj and 
show a steep growth on small scales, even though formally 
C is also of order within the model ((4]). This is because 
the term (|75|) involves the product of two correlations be- 
tween two distinct lines of sight, as seen in Eg . (jStj]) . so that 
each £ is averaged along the radial direction, while the term 
(|76p , which arises from one shot-noise contraction that has 
removed one line-of-sight integration, involves the product 
of two correlations between a central point and two points 



at distances Ri and Rj, as seen in Eq. (|65j) . 

As seen in Fig. [TH1 at fixed i the relative importance of 
these high-order contributions to Cfj increases as the bin 
j shifts to smaller scales. Again, we can see that among 
these contributions the term (|75|) dominates on large 
scales and saturates on small scales, while the and "77" 
terms ([76]) and ([77]) dominate on small scales and strongly 
depend on the smoothing scales. 

That high-order contributions can become dominant 
as one of the bins i and j shifts to small scales agrees 
with expectations, as one probes deeper into the nonlin- 
ear regime where three- and four-point correlation func- 
tion s become important, and with some previous stud- 
ies ([Meiksin fc Whitdll999t IScoccimarro et al.lll999t ). This 
implies that the covariance matrix is less diagonal once 
we take these contributions into account, and it decreases 
the number of effectively independent modes. This can be 
checked in Fig. 1191 where we show the correlation ma- 
trices TZf^ without (middle panels) and with (right pan- 
els) these high-order contributions, for the mass thresholds 
M > 2 x W 14 h- 1 M Q and M > 1O u /i- x M . Therefore, 
for survey characteristics such as those of Figs. [T51ITB1 it 



Fig. 17. The covariance matrix C} 1 ^ of the Landy & Szalay 
estimator, along the diagonal i = j. We show our analyt- 
ical results including all contributions (solid lines) or only 
low-order terms (dotted lines), and results from numerical 
simulations (dashed lines). 
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Fig. 18. The covariance matrix C Z L |, as in Fig.[17l but along 
one row. This corresponds to the fixed bin i = 4, associated 
with the distance bin 12.3 < r < 16.6/i _1 Mpc, as a function 
of j. 



is necessary to include high-order contributions to the co- 
variance matrix of two-point estimators for moderate-mass 
halos that are not dominated by shot-noise effects. 

4.2.3. Comparison with numerical simulations 

We display in Fig. [17] the covariance matrix Cj L | of the 
Landy & Szalay estimator, along its diagonal. We show 
our results obtained when we include the high-order con- 
tributions of Sect. EL3 see Eas.([75 ]) -([77 ]l . and when we 
only take the low-order terms of Eq. fMl) into account. We 
obtain a good match to the numerical simulations, espe- 
cially on the largest scales, which are also more reliable. 
In particular, we recover the strong dependence on radius 
and halo mass. We can see that, for moderate-mass halos, 
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M > 2 X 10 13 ft- _1 Mq, the high-order contributions are not 
negligible (because the low-order shot-noise contribution is 
relatively smaller). 

We show the same covariance matrix along its fourth 
row in Fig. 1181 The results from the numerical simula- 
tions are somewhat noisy, especially for the rare massive 
halos at low radii. However, where they are reliable they 
show reasonably good agreement with our analytical re- 
sults. In agreement with Sect. 14.2.21 it is clear that, even 
more than along the diagonal, the high-order contributions 
of Eqs. ([75)) - (|77l) cannot be neglected in order to obtain a 
good estimate of the off-diagonal terms of the covariance 
matrix (see also Fig. IT9")) . 



4.2.4. Correlation matrices 

We show in Fig. [19] the correlation matrices, denned as in 
Eq. (|4"o]) but for the full covariance matrix dj of the halo 
correlation. (Although IZij is a discrete 10 x 10 matrix, it is 
still possible to draw a contour plot by interpolation. This 
gives clear figures that are easier to read than a density plot 
where each cell is colored with a level of gray that depends 
on the entry IZij.) 

The left and middle panels of Fig. [T5] clearly show the 
strong improvement associated with the use of the Landy 
& Szalay estimator in place of the Peebles & Hauser es- 
timator. In agreement with Fig. [T?] and the discussion in 
Sect. l4.2TTl the correlation matrix is more diagonal for mas- 
sive halos, where the diagonal shot-noise contribution 
of Ea.foTfl) is more important. Indeed, shot-noise effects be- 
come dominant for rare objects. For the same reason, high- 
order contributions to the covariance matrix, which are due 
to sample- variance effects, are more important for low-mass 
halos, as shown by the comparison between the middle and 
right panels. The slope of the contour lines in the right pan- 
els, especially in the low-mass case, shows that high-order 
terms are more important on small scales and also increase 
the correlation between small and large scales while making 
the matrix less diagonal. 

Thus, for low-mass halos there are rather strong corre- 
lations between all scales in the range 5 < r < 100/i~ 1 Mpc, 
and to obtain accurate estimates of error bars on cosmolog- 
ical parameters it is necessary to take off-diagonal entries 
and high-order contributions to the covariance matrix into 
account. 



5. Angular correlation function 

In the previous section we considered the real-space 3D 
correlation function, which requires knowledge of the ra- 
dial position of the halos (or more generally of the objects 
of interest). If this information is not available (e.g., red- 
shift estimates are too noisy or distance measures are highly 
contaminated by redshift-space distortions), it is still pos- 
sible to derive some constraints on cosmology f rom the 
angu l ar distribution of the objects on the sky ([Peebles! 
1980; lEisenstein fc Zaldarriagal " l200lt IMaller et all 120051) . 
Therefore, in this section we apply the formalism developed 
in Sect|3]to the angular two-point correlation function w(9). 



5.1. Mean correlation 

5.1.1. Peebles & Hauser estimator 



As in Sect. 13.2.21 we write the observed number density 
of objects on the sky as N(fl), but we omit the index i of 
Eq. (|4"0")) since we consider a single redshift bin. As in Sect. 01 
the width Az is not necessarily small and may cover the 
whole redshift range of the survey. Then, using notations 
that are similar to Eq. (|4"8"l) . we can write the Peebles & 
Hauser estimator w, as 



1 + Wi = 

with 



l 



dO 

(AO) 



dO 

(An) 



AT(n) 



N(Sl) / ' dd'N(n'), 



de'jv(n'), 



(78) 



(79) 



where N(£l) is the mean angular number density on the 
direction O, given by 



N = I dxV 2 n{z) = I dxT> 



dM dn 



M dlnM 



(M,z). (80) 



Here we used Eq. (fT8|) . and we assumed that the sky cov- 
erage is the same over the survey window (Af2), so that 
N(Sl) is actually a constant that does not depend on ft 
(but the formalism is readily extended to the more general 
case where we add a filter that depends on ft). 

Here and in the following, the index i of Eqs. (j78)) - (l79l) 
refers to the angular bin [9i t -,9i_ + ], over which we esti- 
mate the angular correlation w(0). Again, we denote with 
unprimed letters the quantities associated with the first ob- 
ject, such as its position ft on the sky, and with primed let- 
ters the quantities associated with the neighbor at distance 
9' , such as its position f2' . We use the flat-sky and Limber's 
approximations, which are typically valid for angular radii 
below 10 deg, as seen in Fig. [TU] 

The quantity Qi introduced in Eo. ([79[) can be written 

as 



Qi = Ai N , 



(81) 



using that N defined in Eq. (l80l) does not depend on f2 in 
our case, and Ai is the area of the J-ring, 



Ai = 7T( 



-)• 



(82) 



Then, we proceed as in Sect. 0] Substituting the observed 
3D number density dh/dhiM as in Eq. (|17l) . introducing 
the halo two-point correlation £ h when we take the average 
as in Eq. (|5*2"]) . and using the factorization (fT]), we obtain 



N 
with 



(83) 



(84) 



The superscript "(0)" recalls that Eo. (|54")l is an average over 
the angular ring Ai, instead of the 3D spherical shell V,: of 
Eq. (l5"4"l) . The prime in the subscript "i"' also recalls that 
we integrate over a neighboring point 9' , with respect to a 
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Peebles & Hauser (low order) Landy & Szalay (low order) Landy & Szalay (all orders) 




0.8 1.0 1.2 1.4 1.6 1.8 2.0 0.8 1.0 1.2 1.4 1.6 1.8 2.0 0.8 1.0 1.2 1.4 1.6 1.8 2.0 

Log(R) [Mpc/h] Log(R) [Mpc/h] Log(R) [Mpc/h] 

Peebles & Hauser (low order) Landy & Szalay (low order) Landy & Szalay (all orders) 




0.8 1.0 1.2 1.4 1.6 1.8 2.0 0.8 1.0 1.2 1.4 1.6 1.8 2.0 0.8 1.0 1.2 1.4 1.6 1.8 2.0 

Log(R) [Mpc/h] Log(R) [Mpc/h] Log(R) [Mpc/h] 



Fig. 19. Contour plots for the correlation matrix Hij, defined as in Ea. p5|) but for the full covariance matrix Cy of the 
halo correlation. There are ten distance bins, over 5 < r < 100/i _1 Mpc, equally spaced in log(r), as in previous figures. 
We consider halos in the redshift range < z < 0.8, within an angular window of 50 deg 2 , above the mass thresholds 
M > 2 x 10 13 h~ 1 M & in the upper row, and M > 10 14 /i _1 Af© in the lower row. Left panels: Low-order contributions 
((67|) for the Peebles & Hauser estimator. Middle panels: Low-order contributions ([69| for the Landy & Szalay estimator. 
Right panels: Full correlation matrix, including the high-order contributions of Eqs. ([75l) - ([77|) . for the Landy & Szalay 
estimator. 



given point (x, 2?fi) of the observational cone. However, be- 
cause the two points are only close in the 2D angular space 
(i.e., in the i-ring), we also integrate over the longitudinal 
coordinate x' along the full line of sight in Eq. ([M|) . 

Explicit expressions for (z) are given in App. [Ul In 
contrast to the number counts studied in Sect. |31 where, 
for large angles above a few degrees, it is necessary to 
go beyond Limber's approximation, as found in Figs. [5] 
and 1101 for our study of the angular correlation function 
Limber's approximation is sufficient because we consider 
much smaller angular scales of a few arcmin. 

5.1.2. Landy & Szalay estimator 

As in Sect. 14.1.21 the measure of the angular correla- 
tion can be made more accurate by using the Landy & 
Szala y estimator instead of the Peebles fc Hauser est imator 
dZHD (La ndv fc Szalavl[T99l [S^apudi fc Szaravlll998lh As in 
Eq. ([5t]|) . this reads as 

9 f HO ~ f 

-Qj(An) m) J dem ' ) + 1 ' (85) 



and we can check that its mean is again equal to the average 

(83). 



5.1.3. Comparison with simulations 

We compare in Fig. [20] the mean correlation (l83| with re- 
sults from numerical simulations. The error bars are the 
3 — (7 statistical errors obtained from the covariance matri- 
ces derived in Sect. I5.2.2l for 41 fields of 400 deg 2 as used in 
the simulations. Above 5 arcmin we obtain a good match 
between our results and the numerical simulations. This 
could be expected from Sect. [4] since the angular correla- 
tion is a projection of the 3D correlation. On lower angular 
scales the discrepancy may be due to the finite size of the 
clusters. This implies that £ h = — 1 at distances below the 
sum of the two cluster radii (exclusion effect), but we have 
not included this effect in our bias model (JlJ . Since a typi- 
cal cluster at z = 0.5 (with a size of lh^ 1 Mpc) corresponds 
to an angle of ~ 2.5 arcmin and projection effects are rare 
(since clusters are rare objects with a surface density ~ 10 
deg -2 ), this exclusion effect indeed occurs at 9 5 arcmin 
and appears at slightly larger angles for more massive halos. 
This explains the behavior found on these scales in Fig. [501 
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Fig. 20. The mean angular correlation, (u>^ s ), over eight 
angular bins within 1.25 < 9 < 50 arcmin, equally spaced 
in log(6*). We compare our analytical results (solid lines) 
with numerical simulations (dashed lines). 

5.2. Covariance matrices for the halo angular correlation 

The covariance matrices of the estimators Wi and wf s can 
be computed following the procedure used in Sect.|4]for the 
3D correlation. Denoting again the covariance matrices as 
Ci.j and CfJ, they decompose as in Eg. (|59"|) . 



(86) 



where is a pure sample- variance contribution, whereas 

(2) (3) 

CH and cn are shot-noise contributions that arise when 
either one pair or two pairs of objects are identified. Again, 

(2) (3) 

the contributions C) „ and C) ' also involve the two-point 
and three-point correlations; i.e., they contain terms that 
couple discreteness effects with large-scale density correla- 
tions. 

For the Peebles & Hauser estimator (|78p we obtain, as 
in Eqs.(|6T) )) - (p2")) . 



r (2) - s 

°« (Afi)Q? J (An) 



da 2 dMi r 2 dM,, 



Mi 



For the Landy & Szalay estimator (155"j) only a few of 
these terms remain, as in Eas. ([64"]) - (l66l) . and we obtain 

r LS(2) _ r ,(2) 



c 



LS(3) 



(90) 

da 2 dM /■ „ 2 dA^ 



Mi 



M,, 



2 dMj' dn dn dn 



X J de ? d Xj' V j' M ., din Mi din M v d\nM r 
xfe+C^v], (91) 



LS(4) 



i /• da 



dM ij*>j ,^n d K dn 



2,2, y (An) 
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1 Al 1 A/f' 
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dn /" dJl 



dM ^ fl ',./^2 dM i dn 



-dY.-X^— idfl d X Z> 



dln.\/,' ./ (AO) AJ J ' Mj j Aj j Mj dlnMj 
dn 

dln.\/' - 2 ^' J ' 



(92) 



see afso lSzapu di (2001), and lBernsteinl (|1994f ) who considers 
(up to order n -2 over the inverse of the mean density) the 
additional terms associated with fluctuations of the denom- 
inator in the estimator (1551) . when the latter is normalized 
to the number counts in the same field. 

5.2.1. Low-order terms 

Keeping only the contributions that are constant or linear 
over the two-point halo correlation £ h , as in Eas. (j67l) and 
(1591 . we obtain 



( ' ,J ~ Sid <A<>;0, 



x / dxT> 5 b n 2 



AO) ,AS) A9) 



;t2. 



and 



1?) 



dn 



dlnM dlnM 



7 [i + €m'] 



M^ 
(87) 



C 



LS 



+ =2 I d X V b b n^cyl, 



2[1 + (tBi)] 



(93) 



(Afi)Qj (Afi)iV 3 



d x 2? 5 rn 2 4 ( fj, (94) 



where, in a fashion similar to Eq.(|68p. we introduced the 
average 



C 



(3) 



(AO)fiiQ J - J (Afi) 



M; 



M f ; 



d0i 



^ 2 dMj> dn dn 



dn 
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V 
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xy Mfdxj'Vj, M ^ dlnM . dlnM ., dlnMjl 
x [i+^V + ^V + ^-' + cW] > 

1 



r (4) = 

4J QiQj ./ (An) 



da, ^dMi 2 dA£, dn 
dx * V > l^'^'l^dlnM 



dn 



dn, 



, dM, 



2 dMy dn 



dlnM: J^^ n ^ M ^-' p y^dA^ 



dn 



(89) 



The Fourier-space expression of Eq. (|95[) is given in 
Eq.(|H3|). 



Comparison of Peebles & Hauser and Landy &i Szalay 
covariance matrices 

We compare in Fig. [2T]the covariances matrices (|93|) and 
as a function of j at fixed i. As in Fig. [501 we consider 
halos in the redshift range < z < 0.8 in a survey of area 
400 deg 2 . As was the case for the 3D real-space correlation 
f shown in Fig. I13[ a nd in agreement with previous works 
((Kerscher et al.ll2000h . the covariance is much smaller and 
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Fig. 21. The covariance matrices Cfj (solid line) and Cij 

(dashed line) of the estimators wf s and Wi , for i = 2 associ- 
ated with the angular bin 2 < 9 < 3.2 arcmin, as a function 
of j. We show the results obtained for halos in the redshift 
range < z < 0.8, with an angular window of 400 deg 2 , 
above the mass thresholds M* = 2 x 10 13 and 1O 14 /i _1 M , 
from bottom to top. Here we only consider the low-order 
contributions, given by Eqs. fMf and 
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Fig. 22. The contributions C (2) and C (3) to the covariance 
of the Landy & Szalay estimator, along the diagonal i = j. 
As in Fig. 1211 we only consider the low-order terms, given 
byEq.(|<H|). 



more diagonal for the Landy & Szalay estimator (I55t than 
for the Peebles & Hauser estimator (jTB")) . This can also be 
clearly seen from the comparison of the left and middle 
panels of Fig. [571 where we show the correlation matrices 
associated with Eqs.(|n31) and (IM)) . 



Fig. 23. The low- and high-order contributions to the co- 
variance matrix along its diagonal. We again consider 
halos in the redshift range < z < 0.8, with an angular 
window of 400 deg 2 , above two mass thresholds. 
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Fig. 24. The low- and high-order contributions to the co- 
variance matrix Cf^ , as in Fig. [531 but along one row. This 
corresponds to the fixed bin i — 2, associated with the an- 
gular bin 2 < 9 < 3.2 arcmin, as a function of j. 



the Landy & Szalay estimator. As compared with C^ 3 \ 
C^ 2 -* involves an extra degree of shot noise (one more pair 
identification). Taking only these low-order terms into ac- 
count, the covariance is dominated by (whence shot- 
noise effects are dominant) below 10 arcmin for halos above 
M* = 2 x 10 13 /i _1 M Q , and below 50 arcmin for halos above 
M* = 10 14 /i _1 Af Q . As for the 3D correlation, shot -noise ef- 
fects dominate up to larger scales for more massive and rare 
halos, and this also implies that their covariance matrix is 
more strongly diagonal. 



Comparison of sample-variance and shot-noise effects 

Next, we compare in Fig.[52]the contributions C (2) (first 
term in Eq. (|M]) ) and C^ 3 ' (second term in Eq. flM]) ). again 
keeping only these low-order terms of the covariance of 



5.2.2. High-order terms 

At small angular separations, the high-order terms in 
Eqs. fOTl) and (|52"]l . associated with the product and the 
three- and four-point correlations C and 77, are not negligi- 
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ble. As in Sect. 14.2.21 and in iBernsteinl (|l994h . to estimate 
these high-order correlations we use the "hierarchical clus- 
tering ansatz" shown in Figs. Q] and [3] and given by Eqs.Q- 
Q. As described in App. [Hj we follow the procedure that 
we have already used in App. [F] to compute the high-order 
terms associated with the 3D correlation £. Then, the con- 
tribution associated with the product ^f.^^f/.j in Ea. (|9"2"|) 
writes as 



= ^ [ 2 dyyA^(y)Bl 2 \ye s )Bf\y9 s ), 
N Jo 



if 2 

rr Jo 

the term ,, of Eq.flU]) yields 
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and the term rjf^. ■ of Eg, ([§2"]) gives 
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where the various factors are given in App. iHl 

We compare in Figs. [23] and [24] these high-order con- 
tributions (lM]) - (r9"8l with the low-order contribution (jMJ), 
for the covariance matrix of the Landy & Szalay estimator. 
We recover the qualitative behavior encountered in Figs. [15] 
and[T5]for the estimator of the 3D correlation function. The 
"C" and "77" terms ([9T]) and (|9"5]l show a strong dependence 
on the smoothing scales, while the term (|M|) shows a 
very weak dependence. Again, this is because the contribu- 
tion (|96p involves the product of two correlations between 
two distinct lines of sight, so that each £ is averaged over 
the angular window 8 S of the survey, as seen in Eg . (T9"2"1) . 
whereas the contribution (r3"71) involves the product of two 
correlations between a central point and two points at an- 
gular distances 9i and 9j, as seen in Eq. (|9"Tj) . 

For the case of massive halos, M > 10 14 /i _1 Mq, the 
high-order terms are negligible along the diagonal, which 
is dominated by the shot-noise term, and only give a mod- 
est contribution to off-diagonal entries. Then, the covari- 
ance matrix remains strongly diagonal (for the angular bins 
studied here). 

For the case of low-mass halos, M > 2 x 10 13 /i _1 M Q , 
the high-order contribution (196[) to the diagonal is no longer 
negligible for 9 > 5 arcmin, while the two other contribu- 
tions (p?Tj) and ([§5]) . which involve the three- and four-point 
correlation functions, are always subdominant on these 
scales. This is a convenient property since the modeliza- 
tion of high-order many-body correlations is increasingly 
difficult. However, this was not the case for the 3D corre- 
lation £, as seen in Figs. H"5l and [TBI except on large scales. 
For off-diagonal entries, the high-order contribution (j9"6"]) 
can become dominant for widely separated angular scales, 
while on small scales, 9 ~ 1 arcmin, all contributions are of 
the same order of magnitude. 
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Fig. 25. The covariance matrix Cj L | along its diagonal. We 
show our analytical results including all contributions (solid 
lines) or only low-order terms (dotted lines), and results 
from numerical simulations (dashed lines). 
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Fig. 26. The covariance matrix C Z L |, as in Fig. [55] but along 
one row. This corresponds to the fixed bin i = 4, associated 
with the angular bin 5 < 9 < 8 arcmin, as a function of j. 



5.2.3. Comparison with numerical simulations 

As for the 3D correlation, we show the covariance matrix 
Cj L |, along its diagonal and along one row, in Figs. |2"51 and 
m a gain we obtain a reasonable agreement with the nu- 
merical simulations. For moderate-mass halos, the high- 
order contributions are again necessary to obtain a good 
match on large scales for diagonal entries and on most scales 
for off-diagonal entries. The off-diagonal terms of the co- 
variance matrix obtained from the numerical simulations 
are rather noisy, and our analytical results are competitive 
in obtaining reliable estimates. 



5.2.4. Correlation matrices 

We show in Fig. [57] the correlation matrices TZij, defined 
as in Ea.(|4"6"]). but for the full covariance matrices Cij of 
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Fig. 27. Contour plots for the correlation matrix TZij, defined as in Eq. pSf but for the full covariance matrix Cy of 
the halo angular correlation. There are eight angular bins, over 1.25 < r < 50arcmin, equally spaced in log(#), as in 
previous figures. We consider halos in the redshift range < z < 0.8, with an angular window of 400 deg 2 , above the 
mass thresholds M > 2 X lO^/i^M© in the upper row, and M > 10 14 /i _1 M© in the lower row. Left panels: Low-order 
contributions (l93l) for the Peebles & Hauser estimator. Middle panels: Low-order contributions (|94p for the Landy & 
Szalay estimator. Right panels: Full correlation matrix, including the high-order contributions of Eqs. (|96[) - (I98I) . for the 
Landy & Szalay estimator. 



the estimators of the halo angular correlation. As for the 
3D correlation, we can check that, keeping only low-order 
terms, the correlation matrix of the Landy & Szalay es- 
timator (|8"5|) is much more diagonal than for the Peebles 
& Hauser estimator ([78)1 . Taking high-order contributions 
into account makes the matrix slightly less diagonal, but 
it still remains significantly diagonal, in agreement with 
Fig. [Ml As in the 3D case, the correlation matrix is much 
more diagonal for massive halos, where shot-noise effects 
are more important. The full angular correlation matrix is 
more diagonal than its 3D counterpart shown in the right 
hand panels of Fig. [TH1 and the correlations between small 
and large angular scales are not as strong as the correla- 
tions between the small and large radii found in Fig. 1191 In 
particular, off-diagonal entries and high-order contributions 
play a less important role (although for low-mass halos it 
is still useful to take them into account). 

6. Applications to real survey cases 

In this section, we compare the statistical significance of 
the number counts and of the 3D correlation function for 
future large cosmological cluster surveys (we give in App. [J] 
the selection functions that we use for some of these sur- 



veys). Here we must note that, while redshift-space distor- 
tions only have a low impact on angular number densities 
(number counts and angular correlations) for wide redshift 
bins, they can more strongly affect 3D clustering. In prin- 
ciple, redshift distortions could be corrected to recover a 
real-space map if the velocity field is known, and when also 
applying a finger-of-god compression, but this would re- 
quire a rather complete spectroscopic follow up, so it is 
not very practical. Therefore, observations instead provide 
redshift-space 3D correlations. Then, the results discussed 
in this section for 3D correlations should be seen as a first 
step toward more accurate computations. 

Nevertheless, a simple estimate shows that these 
redshift-space distortions should not strongly affect our re- 
sults. Indeed, we find in the numerical simulations that 
at z — 0.5 for instance clusters have peculiar velocities v 
on the order of 300 km/s along each axis. The redshift- 
space coordinate sii along the line of sight is given by 
S|| = x\\ + v\\/(aH). This yields a typical error Ax\\ for 
the cluster comoving coordinate on the order of 3.6/i _1 
Mpc. This is not much larger than the typical size of the 
clusters, which ranges from 1 to 2/i _1 Mpc. Then, for dis- 
tance bins that are larger than 20/i _1 Mpc we can expect 
redshift distortions to affect our results on the covariance 
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matrices by about 20%. The net effect should actually be 
smaller because the 3D estimators also include information 
on clustering along the transverse directions, which are not 
contaminated by the cluster peculiar velocities. We leave 
an explicit computation of these redshift-space distortions 
to future works. 



6.1. Surveys of limited areas 
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Fig. 29. The mean correlation, from Ea.(|58p. over ten 
comoving distance bins within 5 < r < lOOft^Mpc, equally 
spaced in log(r). We integrate over halos within the redshift 
interval < z < 1, for the XXL, DES, and SPT surveys, as 
in Fig. [28] (again the error bars for SPT are slightly larger 
and shifted to the right with respect to those of DES, for 
M > 5 x 10 /i _1 Mq). The error bars show the diagonal 

part of the covariance, 



<Cff, for the Landy & Szalay es- 
timator, from Eqs.flBH]) and ([75 ]) -(|77 ]> , 



Fig. 28. The mean angular number densities of X-ray clus- 
ters per square degree, within redshift bins of width Az — 
0.1, for the XXL, DES, and SPT surveys. Error bars con- 
tain both the shot-noise and sample- variance contributions, 
from Eas. (l22l) and (j27|) . For DES we consider the mass 
thresholds M > 5 X 1O 13 /i _1 M and M > 5 x lO 14 ^ 1 !/© 
(smaller error bars), and for SPT the mass threshold M > 
5 x 1O 14 /i _1 M (larger error bars shifted to the right). 

We first consider several surveys of clusters of galaxies 
on limited angular windows. 

- The XXL survey ([Pierre et al.ll201lj) is an XMM Very 
Large Programme specifically designed to constrain the 
equation of state of the dark energy by using clusters of 
galaxies. It consists of two 5x5 deg 2 areas and probes mas- 
sive clusters out to a redshift of ~ 2. The well-characterized 
cluster selection function relies on the fact that clusters 
of galaxies are the only extended extragalactic sources, so 
that the selection operates in a two-dimensional param- 
eter space (equivalent to flux and spatial extent), allow- 
ing for different degrees of contamination by misclassified 
point sources. We show the mass detection probabilities as 
a function of redshift in the left hand panel of Fig. I J. 11 for 
the CI selection. The space density of this population is 
~ 6deg -2 . This complex selection function F(M, z), which 
differs from a simple mass or X-ray flux threshold (see also 
iPacaud et all ([20061 . 120071 )1 is readily included in our for- 
malism through a redefinition of the halo mass function, 
n(M, z) -> F(M, z)n(M, z). 

- The Dark Energy Survey (DES) is an optical imaging 
survey to cover 5,000 deg 2 with the Blanco 4-meter tele- 
scope at the Cerro Tololo Inter- American ObservatorjO- We 



https: / /www. darkenergysurvey.org/index.shtml 



consider the expected mass threshold M > 5 x 10 13 H~ 1 Mq, 
as well as the subset of massive clusters M > 5 x 
10 14 /i _1 AfQ, since a binning over mass should help in de- 
riving tighter constraints on cosmology. 

- The South Pole Telescope (SPT) operates at mil- 
limeter wavelength^. It will cover some 2, 500 deg 2 at 
three frequencies, aiming at detecting clusters of galaxies 
from the Sunyaev-Zel'dovich (S-Z) effect. A preliminary 
survey of 178 deg 2 at 150 GHz reveals some 20 clusters 
down to a depth of 18 [iK. Extensive simulations allow 
the determination of the mass completeness level, above 
a given significance for t hese secondary CMB anisotropics 
([Vanderlinde et al.ll2010D . This gives a mass threshold on 
the order of 5 x \^hr x M Q . 

We show in Figs. [2H [Ml and [301 the angular number 
densities and 3D correlations expected for these various 
surveys. The error bars include all shot-noise and sample- 
variance contributions (including high-order terms). For the 
higher redshift interval, 1 < z < 2, we only show the corre- 
lation of clusters above 5 x 10 13 /i _1 A/ Q for DES, because 
in other cases the error bars are too large to allow an accu- 
rate measure. On the other hand, to take advantage of the 
good expected accuracy of this case we consider in Fig. [30] 
distance bins that are half the size of those of Fig. [25] 

As expected, the DES provides the best measures of 
cluster number counts and correlations, hence the tightest 
constraints on cosmology, thanks to its wide size, which 
provides a large number of objects. However, the much 
smaller XXL survey already provides a meaningful mea- 
sure of both the abundance and the correlation of clusters, 
and appears to be a promising tool. The SPT survey al- 
lows a useful measure of the number counts as a function 

7 http://pole.uchicago.edu/ 
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Fig. 30. The mean correlation, for the clusters de- 
tected by DES over the redshift interval 1 < z < 2. Here 
we consider 20 distance bins within 5 < r < 100/i~ 1 Mpc, 
equally spaced in log(r) (i.e. twice as many as in Fig. 1291) . 

of redshift, but its rather high mass threshold leads to a rel- 
atively small number of objects, hence large error bars for 
the 3D correlations, even though a positive signal should 
still be within reach. Assuming its expected mass threshold 
of M > 5 x 10 13 /i _1 M Q remains valid over 1 < z < 2, the 
DES is the only survey among these three that allows an 
accurate measure of the cluster correlation at high redshift, 
which should help to further constrain the cosmology. 

6.2. All-sky surveys 
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Fig. 31. The mean angular number densities of clusters 
within redshift bins of width Az = 0.1. From top to bottom, 
we show a) halos above 5 x lO 13 ft. _1 Af0 in Euclid, b) halos 
detected by Erosita with the selection function of the right 
panel in Fig. I J. 11 c), halos above 5 x 10 14 /i _1 M Q in either 
Erosita or Euclid, and d) halos detected by Planck with the 
selection function of the middle panel in Fig. IJ.ll 



Fig. 32. The mean correlation, (^), integrated over < 
z < 1, as in Fig. [29] From top to bottom, we show a) halos 
above 5 x 1O 14 /i _1 M0 in either Erosita or Euclid, b) halos 
detected by Planck with the selection function of the middle 
panel in Fig. IJ.ll c) halos above 5 x 10 13 /i _1 M Q in Euclid, 
and d) halos detected by Erosita with the selection function 
of the right panel in Fig. IJ.ll 
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Fig. 33. The mean correlation, (^), over the redshift inter- 
val 1 < z < 2, for the clusters detected by Erosita (upper 
curve, with ten distance bins) and Euclid (lower curve, with 
twenty distance bins). 

Following PLANCK, space missions will map the en- 
tire sky in the X-ray (EROSITA) and optical (EUCLID) 
wavebands at unprecedented depth and angular resolution. 
Corresponding selection functions are still at the tentative 
or predictive level. It is nevertheless instructive to compare 
estimates of the statistical significance of the all-sky clus- 
ter catalogs expected from these forthcoming surveyfl In 

8 For all the considered surveys, the Mu m (z) curves were es- 
timated using specific assumptions as to the evolution of the 
X-ray, optical, and S-Z properties of the clusters, hence on 
their detectability. Moreover, the assumed cosmology was either 
WMAP5 or WMAP7, to be consistent with published analysis 
of each survey. Therefore, these hypotheses may not be totally 
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practice, the total angular area of such surveys is not really 
4-7T sterad since we must remove the galactic plane. In the 
following, for Planck we consider the two-sided cone of an- 
gle 9 S = 75 deg (i.e., |6| > 15 deg), which yields a total area 
Aft ~ 30576 deg 2 . For Erosita and Euclid we take 6 S = 59 
deg (i.e., \b\ > 31 deg), which corresponds to a total area 
that is about one-half of the full sky, Aft ~ 20000 deg 2 . 

- Planck operates at nine frequencies, enabling an effi- 
cient detection of the cluster S-Z signature but has a rather 
large PSF (5'-10'). Some 1625 massive clusters out to z = 1 
are expecte d over the whole sk y. We assume the selection 
function by iMelin et al.l ([2006) , shown in middle panel of 
Fig. EJ 

- For Erosita, a simple flux limit is currently assumed 
as an average over the w hole sky: 4 10~ 14 er gs -1 cm~ 2 in 
the [0.5 - 2] keV band (jPredehl et all 120091) . The associ- 
ated selection function is shown in the right hand panel of 
Fig. IJ.ll This would yield 71,907 clusters out to z = 1. 

- For Euclid, we follow the prescription of the Euclid 
Science Book for the cluster optical selection function 
and adopt a fixed mass threshold of 5 x 10 13 /i -1 M© 
(|Refregier et al.ll2010t ). 

We show in Fig. [21] the angular number densities per 
redshift bin. The error bars contain the shot-noise contribu- 
tion (|2"2"1) . as well as the sample- variance contribution (|4"Tj) 
that holds for any angular window and does not rely on the 
flat-sky and Limber's approximations^- The 3D correlation 
functions are shown in Figs. I3"2l and 1551 

As compared with the smaller surveys of Sect. I6.ll these 
(almost) full-sky surveys provide much more accurate mea- 
sures of the evolution with redshift of cluster abundance, 
and of two-point correlation functions, thanks to the greater 
number of objects. In particular, thanks to its lower mass 
threshold, Euclid can probe higher redshifts, both for num- 
ber counts and correlation functions. Although we have 
only considered two redshift bins for the two-point correla- 
tion function, < z < 1 and 1 < z < 2, Figs. l32l and l33l 
suggest that for Euclid it should be possible to introduce a 
smaller redshift binning, such as Az = 0.5. We leave it to 
future works to estimate which redshift binning is the most 
efficient at constraining cosmology. 



6.3. Shot noise versus sample variance 

We show in Fig. [34] the ratio of the shot-noise to sample- 
variance contributions to the covariance of number counts, 

where the rms contributions cr^ and a^'^ are defined 
by Eq. (|2"9"]) . As expected, shot noise becomes increasingly 
dominant at higher redshift, as the number of clusters de- 
creases, and it is smaller for Euclid which has a wider sky 
coverage and a lower mass threshold. 

We show in Fig. [35] the ratio /cr^ 3+4 ' ) of the con- 
tribution (l64l) to the sum of contributions (1651) and (l66l). 



self-consistent with respect to each other, but the main results 
of the comparison between the expected signals should remain 
valid. 

9 Because we consider a symmetric two-sided angular window 
(i.e., two cones of angle 8 3 around the north and south galactic 
poles), the coefficients vanish for nonzero m and for odd 

i. For even £, they are still given by Eqs. (l44|) - (l45p where we 
substitute (AO) -»■ (Af2) h aif, where (Afi) ha if = (Af2)/2 is the 
area associated with a single side (so that the last expression 
(|45|) still applies). 
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Fig. 34. The ratio a^'""^ /a^ v of the rms shot-noise con- 



tribution a N . to the rms sample-variance contribution 

a^' v '\ of the covariance of the angular number densities 
Ni obtained for various surveys. (For DES and Euclid we 
only consider the case M > 5 x 10 13 h~ 1 M !3 .) 
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Fig. 35. The ratio /cr^ 3+4 - ) of the rms contributions 

VC^ and VC( 3 ) + C( 4 ) of the covariance matrix of the 
estimator £ 2 LS . This is a measure of shot- noise effects. 
(For DES and Euclid we only consider the case M > 
5 x lO 13 ft- 1 M .) 



to the rms error <r^ = y/Ci.i. In contrast to Fig. [T?] we 
include the high-order terms of and C^ 4 \ but the ra- 
tio I a^ +4 ^ is again a measure of shot-noise effects. As 

expected, we can see that the contribution C'^ becomes in- 
creasingly dominant for smaller radial bins since they con- 
tain fewer clusters. We can see that the ordering between 
the various surveys is not the same as the one obtained in 
Fig. [5U for the number counts. This is because the mass 
thresholds are not the same (and couplings between shot- 
noise and sample- variance effects in the covariance matrix 
of halo correlations make the analysis less direct). Since a 
higher mass means both a larger correlation function and 
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Fig. 36. The ratio cr^ +C+,,) /^f of the rms high-order 
contribution (|75l) - (]77[) to the rms low-order contribution 
(second term in Eq.(|69p) of the sample variance of the corre- 
lation £j obtained for various surveys. (For DES and Euclid 
we only consider the case M > 5 x 10 13 Ii~ 1 Mq.) 



larger discreteness effects (because halos are rarer) , it is not 
always obvious a priori how the relative importance of shot- 
noise effects changes from one configuration to another. 

Here we must recall that Fig.[35]only shows the diagonal 
part of the covariance matrix Ci j and that off-diagonal 
terms can be non- negligible, see Sect. |H 

6.4. High-order and low-order contributions to the sample 
variance of ^ 

We show in Fig. (3B] the ratio of the high-order contribu- 
tions to the low-order contribution of the sample variance 
of the 3D correlation £j. We consider the Landy & Szalay 
estimator and we define along the diagonal the rms contri- 



buttons as CT Jf ?+c+ " ) 



a 



LS(«) 



^LS(C) , ^LS(r,) 



Eas.([75 ]l -([77 |l for the high-order term, and a^f = yJC, 



from 



-,LS(£) 



for the low-order term, where is given by the second 

term in Eq. (jrJ9"]) . We do not consider here the shot-noise 

f 21 

contribution <r|. associated with the first term in Eg. (1691) . 
which was studied in Fig. [35] however, while Cf^ and 
Cff^ are pure sample-variance contributions, Cf^^ and 



C 



LS(?) 



are mixed shot-noise and sample-variance contri- 



butions. Indeed, they arise from both the discreteness of 

the halo population (as shown by the power N instead of 

N , which comes from the identification of two objects as 
explained in Eq. (|D.2p ) and its large-scale correlations (as 

—3 —2 

shown by the bias factors b and b ). 

As could be expected, we can see in Fig. [35] that the 
relative importance of high-order terms increases on smaller 
scales, deeper in the nonlinear regime where correlations are 
stronger. However, in some cases there is a flattening on 
larger scales because the relative importance of high-order 
terms no longer decreases (and could even increase in the 
case of low- mass halos as seen in Fig. IT51 . This is because 



the low-order contribution C i i is actually a mixed "shot- 
noise and sample- variance" contribution, as noticed above, 
and shot-noise effects decrease on larger radii (because of 
the greater volume), as seen in Fig. [35] In agreement with 
this explanation, we can see that this upturn appears earlier 
and is greater for the surveys where shot-noise effects are 
less, that is, Erosita, Euclid, and DES. 

More generally, Fig. 1351 shows that high-order contribu- 
tions to the sample- variance or mixed terms are not negli- 
gible (but on small scales along the diagonal the covariance 
matrix is often dominated by the pure shot-noise contribu- 
tion). For the variety of cases studied in Fig. [35] they do not 
grow above five times the low-order contribution along the 
diagonal, but as shown in Figs. [T51and ll8l thcir importance 
can be greater far from the diagonal. Then, these contribu- 
tions should be taken into account if one requires accurate 
or safe estimates of signal-to-noise ratios. 

6.5. Dependence of the results on cosmology 

We investigate in App. [K] the sensitivity of our results 
to the value of the cosmological parameters, by compar- 
ing the curves obtained in the previous sections with those 
that are obtained when we change either h, f2 m , or as by 
an amo unt that corresponds to the current "2 — a" uncer- 
tainty ([Komatsu et alj|2011| ). We find that the main fea- 
tures shown in Figs. [3H [35] and l3rJl remain valid, with mod- 
est quantitative changes (e.g., shot- noise effects become 
slightly less important, with respect to sample- variance 
contributions, when as is slightly increased). Therefore, our 
results and conclusions are not sensitive to the precise value 
of the cosmological parameters (within their current range 
of uncertainty) . 



7. Conclusion 

In this paper we have presented a general formalism for 
obtaining analytical estimates of the means and covari- 
ance matrices of number counts and correlation functions, 
for distributions of cosmological objects such as clusters of 
galaxies or galaxies. To do so, we assumed that the two- 
point correlation function of these objects can be factored 
in terms of a linear bias model, and this simplifies expres- 
sions as spatial and mass (or luminosity, temperature, etc.) 
integrals factor. To estimate the high-order contributions 
to the covariance of two-point estimators, we also assumed 
that the three- and four-point correlations can be described 
by a hierarchical ansatz, that is, that they can be written 
as products of the two-point functions. This is the simplest 
model that agrees reasonably well with realistic distribu- 
tions (of the dark matter density field, as well as of cos- 
mological objects such as galaxies or clusters that follow 
the dark matter density on large scales). Although this is 
only an approximate model and it is known that actual 
cosmological fields do not exactly obey such a hierarchi- 
cal clustering, this allows us to derive explicit expressions 
that provide a reasonably good description of covariance 
matrices. 

The main differences or improvements with respect to 
previous studies are the following. 

- Keeping the application to cluster surveys in mind, 
rather than the galaxy surveys that have been the aim of 
most works, we considered two-point estimators that in- 
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volve integrations over broad redshift bins. Thus, we do not 
work with local 3D correlations within an homogeneous and 
isotropic box at a given redshift, but with averages over a 
redshift interval with explicit integration along part of the 
observational cone, where the radial direction plays a spe- 
cific role. 

- We took all shot-noise and sample-variance contri- 
butions into account, along with high-order contributions, 
which in the present case of one-point and two-point estima- 
tors involve products of two two-point correlation functions 
and the three- and four-point correlations. 

- Within the framework of the simple hierarchical model 
recalled above, we gave explicit expressions for all contri- 
butions to these means and covariance matrices. They can 
be readily used for any population of objects and any set 
of cosmological parameters, provided one is able to com- 
pute the mass function (or the luminosity/temperature 
function), the two-point correlation and three- and four- 
point normalization parameters. In practice, assuming a 
linear scale-independent bias model (or a uniform scale- 
dependence that can be absorbed into the two-point corre- 
lation), it is sufficient to give a bias b(A / I,z) in addition to 
the mass function. 

We first studied the number counts per redshift bins, 
comparing the relative importance of shot-noise and sam- 
ple variance contributions and giving scaling laws obeyed 
by the signal-to-noise ratios, as a function of the survey 
area and the number of fields. We have explicitly consid- 
ered the case of large angular windows, and estimated the 
angular scale where the flat-sky and Limber's approxima- 
tions break down, which occurs at about 10 deg. We also 
computed the decay of correlations between distant red- 
shift bins. In particular, we checked that a redshift bin- 
ning of width Az = 0.1 is broad enough to neglect cross- 
correlations between different bins. 

Next, we studied estimators of the 3D correlation func- 
tion, averaged over finite redshift intervals. We compared 
the Peebles & Hauser estimator with the usual Landy & 
Szalay estimator, and we evaluated the relative impor- 
tance of shot-noise and sample- variance, low-order and 
high-order, contributions to the covariance matrix. We also 
considered the behavior of the off-diagonal terms, and de- 
scribed how high-order contributions make the covariance 
matrices less diagonal as correlations develop between dif- 
ferent scales (especially as one of the scales becomes smaller 
and more nonlinear). Then, we performed the same analysis 
for the 2D angular correlation function. 

Throughout we compared our analytical expressions 
with results from numerical simulations and we obtain a 
reasonably good match. This makes such analytical results 
more competitive than simulations, because they are much 
faster to compute and allow one to describe rare objects 
that would have low-quality statistics in the simulations. 
Finally, we applied our formalism to several future clus- 
ter surveys, and considered both limited-area and full-sky 
missions. 

We hope our results can help for estimating the signal- 
to-noise ratio of current and future surveys. This is useful 
for comparing the efficiency of different probes and different 
survey configurations, such as the choice of redshift binning, 
survey area, or number of subfields. 

Our study should be extended in several directions. 
First, it would be interesting to consider the noise associ- 
ated with photometric redshifts. Second, one should include 



the effect of rcdshift-space distortions, which are likely to 
be important on small scales. Third, the computation of the 
means and covariance matrices studied in this paper is only 
an intermediate tool for comparing theoretical predictions 
with observations, and the final goal is to derive constraints 
on cosmological parameters or astrophysical processes (e.g., 
scaling laws for cluster mass-luminosity-temperature rela- 
tionships). Our results may be used to further investigate 
the cosmological information that can be extracted from 
cluster surveys and to optimize observing configurations so 
as to improve those constraints. We leave these tasks to 
future works. 
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Appendix A: Mean and covariance of number 
counts 

To avoid introducing numerous Dirac factors, owing to the discrete- 
ness of the observed distributio n of ob j ects, w e follow the simple ap- 
proach described in Sect. 36 of Peebles ( 1983) to compute the statis- 
tical properties of counts in cells. We illustrate in this section this 
method for the computation of the mean and covariance of number 
counts within redshift bins. 

We divide the "volume" of the space (z, CI, In M), which enters 
the expression 1171 of the angular number density of observed objects 
in redshift bin i, over Af small (infinitesimal) cells labeled by the index 
a, so that Eq. 1171 1 reads as 



(An) 



(A.l) 



where subscript i refers to the redshift bin i. Then, since the cell on is 
infinitesimally small it contains at most one object, whence llPeebled 
H980h 



fi>cti = or 1, and h a . =h ai . 

Moreover, by definition its average is given by 



dM dn 



{h a .) = dz—Vdil 

x *' dz M dlnM 



(A.2) 



(A.3) 



Of course, we recover for the mean number of objects in the redshift 
bin i the expression 11911 . which could also be read from Eg. 11711 using 
the average 



dn 



d In M din AI 



(A.4) 



We now consider the covariance of the angular number densities 
Ni. From Eq.lTSjJ we have 



hj/^aj) + ^ {h ai h aj ) 
Si, j i/J(n ai ) + (h ai h aj ). 



(A.5) 
(A.6) 
(A.7) 



In the second line we used the fact that the redshift bins do not 
overlap, so that for two "volumes" on and ctj to coincide, bins i and 
j must be the same (and 5ij is the Kronecker symbol), while in the 
third line we used Eq. lA.2l l. The first term in Eq. lA.7l l corresponds 
to the shot noise, due to the discreteness of the object distribution. 
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distance r. We denote F[£) as the fraction of this sphere that is en- 
closed within the observational cone. In our computations elsewhere 
we used the approximation F = 1, but for R s — r<£<R s we actually 
have F < 1. As in the transverse view shown in the left hand plot of 
Fig. IB.ll the angle 9i associated with the farthest point of intersection 
between the cylinder and the sphere satisfies I + r sin 0£ = R, whence 



Rs~r<£<R s , 0<9 e <- 



R s 



sin t>£ 



(B.l) 



Fig. B.l. Geometrical illustration of finite-size effects. 
Close to the survey boundary, part of the sphere of radius 
r extends beyond the observational cone and should not be 
counted. The left plot is a transverse view, orthogonal to 
the central line of sight, whereas the right plot is a view 
from a point far away on the line of sight. 



The second term includes the nonzero-distance correlation between 
objects, and reads as (for cti ^ ctj) 



(n ai n aj ) = (n az )(n aj ) [l+CL 



(A.8) 



where £a iiQj is the "halo" two-point correlation function between 
"volumes" cti and ctj, see IPeebled l fl98Cl 'l. This yields 



(NiNj) 



(Ni) 



df2; dMj dn 



' " * J (AQ) ' /. ~* 1 (AO.) Mi dhiM; 



{AQ) Mj dlnMj 



(A.9) 



using obvious notations where we label the quantities associated with 
Ni and Nj by the subscripts i and j and we integrate over the bins i 
and j. This could also be directly obtained from Eg. (I17H by writing 



dn dn 
'din Mi din Mj' 

M, 



dn 



dn 



din dlnMw 



S D (Xj ~ Xi)Sn (fij - toi)Sn (Mj - M, ) 



+ 

dn 



din Mi 



(A.10) 



where the second term with the Dirac factors gives the shot-noise 
contribution. 

In this derivation we have assumed in Ea JA.2l l that space can 
be divided into infinitesimal volumes that contain either zero or one 
object and that each object only appears in one cell. Even though 
clusters and dark matter halos are actually extended objects, it is still 
possible to define a point distribution by associating a single point to 
each cluster or halo, f or instan c e the halo mass center. Thus, this ap- 
proach, which follows Peebles (f980), applies to these cases as well 
and to any distribution of discrete objects, as long as we restrict our- 
selves to count distributions and do not study the internal structure 
of these objects. 



Appendix B: Finite-size effects 

As noticed in Sect. [4] in our computations of the mean and covariance 
of the estimators § and £ LS we neglect finite-size effects. Indeed, we 
do not take the fact into account that when a point i gets close to 
the survey boundaries the available space for points i' located in the 
distance bin [Ri _ , fl< +], with respect to point i, is only a fraction of 
this spherical shell since a part of it extends beyond the observational 
cone. This means that we overestimate the total number of pairs. This 
has no impact on the mean, since this effect cancels out between 
the numerator and denominator in 1148 II . but it means that we slightly 
overestimate the signal-to-noise ratio. 

To estimate the magnitude of this error, we compute the geomet- 
rical factor illustrated in Fig. IB.f I Approximating the observational 
cone as a cylinder of radius R s = T>9 3 , a point i at distance i from the 
central line of sight is the center of a spherical shell of radius r, onto 
which we count all neighbors i' to estimate the correlation £ at this 



Next, in the plane of each vertical section (i.e., at fixed 9), shown 
in the right hand plot of Fig. IB.fl that corresponds to a projection 
along the line of sight, the cylinder appears as a circle of radius R s , 
whereas the section of the sphere of center i appears as a circle of 
radius rsin(0). Both circles intersect (again for R s — r < £ < R s ) at 
the symmetric polar angles tp±, with 



' 9 + 2£r sin 9 sin tp± . 



(B.2) 



Then, the surface of the sphere that extends outside of the observa- 
tional cylinder writes as 



4r' 



71-/2 



7I-/2 

de sine I dip. 



(B.3) 



Thus, for R s — r < £ < R B the fraction of the sphere that is enclosed 
within the observational cylinder reads as 



F(£) = 1 



cos 9 1 



. / R 2 s -£ 2 -r 2 (l-x 2 ) 

— Arcsin — ■ — - 

7T y 2£r vT - x 2 



(B.4) 

whereas F(t) = 1 for < £ < R s — r. Then, integrating the position of 
the central point i over the cylinder, the fraction of volume for pairs 
at distance r, with respect to the approximation F = 1, writes as 



N J Q R a Rs 



--F«). 

Rs Rs 



(B.5) 
(B.6) 



This gives the ratio of the number of pairs N' , which is measured in 
the survey, to the number TV obtained when we do not take finite- 
size effects into account. For instance, at z = 1, which corresponds 
to the angular distance T> ~ 2352/i _1 Mpc, and for a survey angular 
window of area 50 deg 2 , which corresponds to 9 S ~ 0.0696 rad, we 
have R 3 = V9 S ~ f64fe- 1 Mpc. Then, we obtain TV'/JV ~ 0.91 for a 
shell at radius r = 30h~ 1 Mpc. This means that the approximation 
F = 1 overestimates the number of pairs by about 10% and the signal- 
to- noise ratio by 5%. 



Appendix C: Computation of the mean of the 
estimators £ and £ LS 

Defining the 3D Fourier-space top-hat as 

W 3 (kR) = e ik r = 3 Sin(fci?) -^ COS(fc/?) , (C.l) 



J 4ttR 3 /3 (kR) 3 
the 3D Fourier-space window of the i-shell reads as 
d£ eik . r 

Rl + W 3 (kRi, + ) - Rl_W 3 {kRi,-) 

R l+ ~ ' 

where the superscript (3) recalls that we consider a 3D radial bin. 
Then, writing the two-point correlation function in terms of the power 
spectrum, as in Eq.j3]l, we obtain for its radial average I I54H 



(C.2) 



dk 



— A*(k,z)w£ 3 \k). 



(C.3) 



(Here i and i' refer to the same radial bin; the prime only recalls that 
we are integrating over a neighbor i' within a small radial shell with 
respect to another point in the observational cone.) 
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Appendix D: Derivation of the covariance of the 
Peebles & Hauser estimator £ 

We compute here the covariance of the estimators £j , which is identical 
to the covariance of the quantities To simplify the expressions 

we do not consider mass binning here, but it is straightforward to 
generalize to the case of several mass bins. From the definition H48H 
we can write with obvious notations the second moment as 



((1 + &)(!+&)> 



1 



dfii dM; 



(AH) Mi 



dr. 



1 / 9 dfi, dM, 



x{ 



dh 



dh dh 



dr 3 



dh 



din Mi din dlnM,- dlnMj 



(D.l) 



The average in Eg. IIP. ill can be written as in Eg JA.lOII . with many 
Dirac factors for the shot-noise contributions. However, as in App. [X] 
it may be easier to follow Peebles (1980) and to divide "volumes" 
over small (infinitesimal) cells that contain h objects, with h = or 
1. Then, we can split the average (hihi/hjhji) as 

(hihi/hjhj,) = (nih l thjhji)^ s ' v '' + S it j(hih i ihji)( s - v -' > 
+&i ai (hih i ,hj)( s ' v '' > + Si> j (fnhiihji) ' s,v '' 
+8 i , )J ,(hih i ,h j y s - v ^ +8 i , j 6 i , J ,(n i n i ,)( s - v -'> 



+5 itJ ,6 t , tJ {hihi,)( s - v -\ 



(D.2) 



where we have explicitly written the first "pure sample-variance" con- 
tribution and the last six "shot-noise" contributions associated with 
the Kronecker symbols. The remaining averages with the superscript 
"(s.v.)" denote "sample-variance" averages, that is, without further 
shot-noise terms. Here we used the fact that the objects i and i' 
are separated by the finite distance ry, with fy > -R;,— , so that 
the elementary "cells" i and i' cannot coincide and there is no shot- 
noise contribution of the form 5; j/ . For the same reason there is no 
term Sjji. Ne xt, the "sample- variance" averages of Eq. llD.2t read as 
JPeebledll98Ch 

(hihimjhf)^ = (ftiXAi'XftjXV) [! + &' + 



1 -3 * ,3 '»),)' ,3,3 ,3 ,3 

**>*'*i>i' '^i,3^i',3' '^i,j'^i',3 ^i,i',3,3'} ' 



(D.3) 

(D.4) 
(D.5) 



where £ h , f h , and r; h are the two-point, three-point, and four-point 
correlation functions of the objects. Since we have 



mi) - = <(i+&)(i+&)> - <i+^xi+«i>, 



CD-' 



we obtain from Eqs JD.ll l-l lD.5ll the decomposition (I59H of the covari- 
ance matrix, with the explicit expressions (1601) - (162 ft of the various 
"sample-variance" and "shot-noise" contributions. Here we used the 
svmmetriej^l {i o i'} and {j o j'} of Eq. IIP. It . In Eq. lRHl the 
object "j"' is at the distance ry from the object "i", since this shot- 
noise contribution comes from the case where the objects i and j are 
the same object (or from one of the three remaining cases "i = j'" , 
"i' = f\ or "i' = j'"). The shot-noise contribution (160 H comes from 
the identification "i = j and i' = j"' (or "i = j' and %' = j"). This 



10 The integration over the points i and i' in Eq. (|D.l|l should be 
understood as J dxiX>fdfij J dXs'X^/dfij/ @(Ri- < |xj— Xj/| < 
•Ri,+), which is more clearly symmetric, where O is a top-hat 
window that takes values or 1 with obvious notations. In prac- 
tice, we actually slightly "break" this symmetry by using the 
variables (xt, £1%', iy ) as in Eq. (|D.l| ) if we do not take boundary 
effects into account, as in this paper. 



implies that the distances ry and ry are equal, which gives rise to the 
Kronecker symbol Sij since we consider the case of nonoverlapping 
distance bins [Ri _ , B4 +] . 



From Eo J52H the contribution of E0 J6OI I also reads as 



1.1 ( 'J 



(AQ)Qi 



(D.7) 



In order to estimate the contributions C^ 3 ' and c' 4 ' we assume that 
the radial bins , Ri,+] are restricted to large enough scales to ne- 

glect three- and four-point correlation functions, as well as products 
such as fi-yfj'-j*. Thus, we only keep in this Appendix the contribu- 
tions that are constant or linear over the two-point correlation func- 
tion £i ;J of the objects, which we recall with the superscripts "1" and 
below. Moreover, we again assume that the two-point correlation 
function can be factored in as in Eq. JTJ. 

The first contribution to Cj) , associated with the factor 1 in the 



brackets in E0.I I6II I. reads as 



C 



(3,1) 



d x V 2 n i V i V j . 



The contributions that are linear over £ sum up as 



(D.8) 



C 



(3,0 



(An)QiQj 



d X 'D 2 b 2 n 3 V l V J 



e( r ) 1 t( r ) 1 t( r ) 
V Sy 



(D.9) 



where Q, and J , are defined as in Eos. II 541 1 and JC3t ■ whereas Ty 



is defined in Eg. (I68H and also writes as 
dk 



A' 2 (k,z)wj 3 \k)W^'(k) 



(D.10) 



Next, at this order the contribution (162 1 to the covariance simplifies 



C 



(4,0 



d x V 5 b n^ViV^cyi, 



(D.H) 



where ^ cy i is Limber's approximation (I13H to Eq. (Sj) • Then, collecting 
all terms, we obtain the expression H67H for the covariance . 



Appendix E: Derivation of the mean and 

covariance of the Landy & Szalay estimator £ LS 

We can relate the Landy & Szalay estimator £ LS defined by Eq. H56H 
to the Peebles & Hauser estimator 1148 II by 



4 LS = 6 - 2^ 

where we defined the cross-term by 



(E.l) 



i+i c 



dx 2 do dM 
dz — 23 

dz (Aft) M 

dh dn 



dr' 



dM' 
M' 



dlnM dlnM' 
We obtain at once, using Eos. llA.4ll and (1501) . 

<£> = 0, 



(E.2) 



(E.3) 



which leads to Eq. (I57tl . 

From the relation (IE. Il l we have for the covariance of the estimator 



(E.4) 
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where Cj w is the covariance of the Peebles & Hauser estimator £, 
defined in Eq. J59}. To compute the cross-terms in HE,4| | we write as 
in Eo. luTTIl . 



1 



((1 + £;)(!+£■)> = 7T / A ^ V i 



(An) Mi 



dr Ai - 



dM,, 



1 , dZlj dM, 

X Q-/ d ^(A?f) Mf 
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dn 
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d In Mi d In M v d In Mj d In M 



Proceeding as in App. [D] this gives 



c: 



with 

2 



(4) 



c 



; (3) 



(An)0 4 C 



2 dn, dM, / dM 
dXi T> i v^rr / dr 



, dM,/ dn 
X / dr 
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3 My dlnM, dlnM 4 , dlnM,-, 



1 M V 



(E.5) 



(E.I 



(E.7) 



Appendix F: Computation of high-order terms for 
the covariance of £ LS 

We compute here the high-order terms for the covariance C LS of the 

Landy-Szalay estimator fP" that we had neglected in Eq. H69ll , 

For numerical computations, it is often more efficient to express 
the quantities that we encounter in this work in terms of the real-space 
correlation ^(x), instead of the Fourier-space power spectra P(k) or 
A 2 (fc), provided £(x) is known (e.g., computed in advance on a fine 
gri ef"!) . Indeed, this replaces oscillatory integrals by integrals with 
slowly-varying factors, which allows faster and more accurate compu- 
tations. This comes from mostly considering various kinds of volume 
averages of correlation functions, such as Eq. (f5J , which are more nat- 
urally written in configuration space. This yields integrations over 
bounded or unbounded domains with typically positive and slowly- 
varying kernels. In contrast, the transformation to Fourier space yields 
highly oscillatory kernels as soon as some underlying real-space vol- 
umes are finite with a size much larger than some other scales (see for 
instance the 2D top-hat il'2t for a window 9 a that is much broader 
than the typical angular scale l/(k±T>)). On the other hand, interme- 
diate analytical computations are often easier to perform in Fourier 
space, mostly because of the convolution theorem. Then, a convenient 
method is to first write expressions in terms of Fourier-space power 
spectra, perform integrations over angles, and finally go back to the 
real-space correlation function, using the fact that from Eq.j3]l, £(x) 
and A 2 (k) are related by 
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Next, to compute the last term in Eq. (IE.4I I we write 



((l + S)(l + 3)> 
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(E.I 
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dk . n , , sin(fc:r) 
A 2 (k) — ^—L, 
k kx 



dx 



A 2 (k) = - / — i(x)(kxy sin(fcc). 



(F.l) 



(F.2) 



As shown below, this method also allows partial factorization of most 
integrals. 

A first high-order contribution to the covariance C^j arises from 
the product .j in Eq, ]66l l, which also writes as Eq. (1751 1 where 

(r) (r) 

we introduced the quantity £• defined by 

^,3 1 ',3 



d Xj 



dflidflj 

(An) 2 



dr., dr. 



(F.3) 



Expressing the two-point correlation functions in terms of the power 
spectrum, using the flat-sky (small angle) approximation, as well as 
Limber's approximation as we did for Eq. we obtain after integra- 
tion over angles and over the two radial shells, 



whence 
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Collecting all terms in Eq. IIE.4ll . which reads as C^? = Cij — 2C, C . — 
2C| i +4C^ , we obtain the decomposition 1163 II with the contributions 



(r) (r) _ 2tt 

^i\j'^i'\3 X> 



dkidk 2 P(ki)P(k 2 )S D (k n +fc 2 ||) 



x wf J \k 1 )w j (3) (k 2 )w 2 (\k 1± + k 2± \ve s ) 2 . 



(FA) 



Again, the factor 27r<5£>(fc 1 || +fe2||) comes from the integration over 
Xj, which suppresses longitudinal wavelengths. Using the exponential 
representation of Dirac functions, Eq JF,4|I can be partially factorized 
as 



tWtW 



dr 



D(2tt) s 



dkidk 2 P(k 1 )P(k 2 ) #f '(fci) wj 3) (k 2 ) 



c Jdk ± W 2 (k ± V8 a ) 2 e ir r ( - k i\\+ k 2\0+ ir ±< k i±+ k 2±- k ±\ (F. 



5) 



and the integration over angles yields 
dr f dfei A 2 (fci) 



ki Vk L 



sin(ifcir)Wi (3) (fci) 



11 In practice, we compute in advance £(x, z) on a 2D grid, over 
distance and r edshift, u sing Eq. (IF.l|l and the nonlinear power 



spectrum from [Smith et aLl (|2005fi . To obtain meaningful and 
accurate results, one needs to make sure that is accurately 
computed, especially on large scales where one should recover 
linear theory and a smooth two-point correlation. 
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dk 2 A 2 {k 2 ) . 



sin(fc 2 r)iy (3) (fc 2 ) 
k 2 Vk 2 J 



dk 



-Vksin(kr)W 2 (kVd s f 



(F.6) 



This also reads as 



= 29. 



{ve s y 

where we introduced 



f Z C3) (r) x (3) (r) A (3) (JL\ (F .7) 



and 



du sin(yu) W 2 (u) 2 , 



a2,,^UM» t - /(3) 



H^ w (fc). 



(F.; 



(F.9) 



The function A® (y) can be written 



0<y<2:A( 3 )(y) 
y>2:A< 3 >(y) 



— [3ny-2(4 + y 2 )E(y/2) 
Sir 1 

-2(-4 + 3/ 2 )K(j//2)] , 



j£ [37r-(4 + j, 2 )E(2/j,) 



3?r 



+(-4 + y 2 )K(2/y)] 



(F.10) 



(F.ll) 



where K(fc) and E(fc) are the complete ellipti c integrals of the first 
and second kinds (Gradshtcyn & Ryzhik 1965). One can check that 
A' 3 ' (y) is a positive, nonoscillatory, and continuous function (but not 
analytic at y = 2), with A^(y) ~ 2j/ for y — > and A( 3 )(y) ~ 
for j/ — > oo. 

Going back to configuration space, by substituting Eq. <F.2t , the 
integral (I F . 9 1) can be written as 



with 
W 3 (a,b) 



dx x£(x) 
r{Rl + -RU 



Ri. 



Ri, 



-R?_W 3 



Ri,- Ri, 



du sin(cm) sin(6ji) W 3 (u) , 



(F.12) 



(F.13) 



which for a > and b > is given by 



where f was defined in Eg. II 541 1, because the integrations over ry 
and Ty are independent. The second term reads as 



At) Ar) 



dkidka P(fe 1 )P(fe 2 ) Wf'dk! + k 2 |) 



xwf\k 2 ), 



(F.17) 



which no longer factors. Introducing an auxiliary wavenumber k and 
the Dirac factor (5j)(ki +k 2 — k), which we write under its exponential 
form as in Eq. |fF3t . and using the inverse Fourier transform of the 
3D shell EH, 



8(r g Vj) 
Vi 



dk 

(2tt) 3 



ikr ^ 3) (fc) 



1 / dk k3 sHkr)_^ 3) 
2tt 2 / k kr 1 K " 



as well as Ea. lfF~9l l. we obtain 
f ^^ r) X^(r). 



(F.18) 

(F.19) 
(F.20) 



The third term in Eq. (IF. 15ft is obtained from Eq. HF.20II by exchanging 
the labels "j" and u f . 

We now turn to the four-point contribution to Eq. (I66H - using 
Eq. (O for the halo four-point correlation function rjr , . .,. Thanks 

to the symmetries {i -o- i'} and {j -O- j'} we have two different contri- 
butions (a) and (b) associated with the topology of the left diagram 
in Fig. [2] each with a multiplicity factor 2, and four different contri- 
butions (c), (d), (e), and (f), associated with the topology of the right 
diagram, with multiplicity factors 4, 4, 2, and 2. 
The first contribution (a) reads as 



C 



LS(4,a) 
' i,j 



(F.21) 



with 



,(r) t Ar) / dXi / dnjdgj j diydiy 



Proceeding as for Eq. lF.3l . we obtain 
dr r 



(F.22) 



(F.23) 



\a-b\ > 1 
|o — 6| < 1, a + 6 < 1 
jo - b\ < 1, a + b > 1 



W 3 = 
W 3 = 3ab 



W 3 



Hl-(a-b) 2 ]. 



(F.14) 



Thus, using Ea.l lF~T2l . the quantity of Eq. JFTfll involves 

slowly varying integrals over real-space variables, which partially fac- 
tor as three factors within the integrand of Eq. llF.7ll . This makes it 
more efficient to use Eq. J F . Til than the Fourier-space expressions IF. Il l 
or jFlH l. 

To evaluate the two remaining contributions, associated with the 
factors Ca> j' m Eq. 1651 and Vi,i';j,j' i n Eq. 1661 . we use the model for 
the three- and four-point correlation functions described in Sect. l2.L2l 
Thus, using Eq.@ for the three-point correlation function that enters 
Eq J65II , this contribution to Eq J65l l reads as 



LS(3,C) 



{AQ)QiQj 



The first term in the bracket in Eq. flF.15l) is given by 



(F.15) 



(F.16) 



The contribution C^' 4,b ' is the symmetric one with respect to {i <-» 
j} of Eq. (IF.21I I; that is, the product Q, r ' X (.i-jS,^, is replaced by 

Next, the contribution C LS reads as 



QiQj 



(F.24) 



where the geometrical average writes as 



(F.25) 



since integrals over r^/ and ry can be factored. 

(r) (r) 

The contribution (d) involves f •. •/ where no factorization 
is possible. Proceeding as for Eq. HF.3l l we obtain 

dr r 



f M ( , .tW 



{ve s y 



i{r)xf\r)xf\r) [■ 



r 

ve. 



(F.26) 
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The contribution (e) involves ^r?-£—/£v?- tnat can be written as Eq.llFJjJI, we obtain after integration over angles 

3 ^j 1, 1 'i3 

s j':i s 2,i' s i':.l s / Tia \2 \ -rifl / 3 ' ' 



(ve s ) 2 \vej i w ' n 

R '.+ dr'r' 2 ^ /' r+r ' dr"r" f( X j Ak 1 k 1 P(k v ,z t )W { j 2) (k 1 V,) f dk 2 k 2 P{k 2 ;z j ) 

?v ) / ; s( r )i (F.27J 

r r' 



I! ' . /. , ! , 



whereas contribution (f ) is obtained from (e) by exchanging the labels 

V and "j". r 

Collecting all terms, the high-order contributions to the covari- X / dx x Jo{xy) Wi (xB s ) ■ (H.3) 

ance matrix are given by Eqs. l75t - 177> . J 

Then, after a rescaling of variables x and y, and defining the quantities 

Appendix G: Computation of the mean of the ,„> f°° „ 

. „ , , T q A( 2 %)= / duu J (y«)H/ 2 («) 2 , (H.4) 

estimators w and w Lb w J v ; v ; ^ ; 

We give here explicit expressions of the average 1841 of the correlation . 

function over an angular ring. As in Sect. [2TT731 using the flat-sky and B^ 2 \d) = / dxT>^ b 2 n 2 X^ 2 \d) (H 5) 

Limber's approximations, we obtain J % 

^W-r?^^ (G ' 1} 1?\0)= t^^MkV e) W?\kV), (H.6) 



k Vk • A > l "'"J III 

where we introduced the 2D Fourier-space window of the i-ring, . . , . -™ . , . ,™, . „ . 

we obtain the expression 11961 1. using the property AS '(y) = tor 

n y > 2. As compared with Eq. tH.2t . introducing the Dirac factor and 

W^ 2 \kj_T>) = / gikx'f 6 the two auxiliary variables x and y has allowed us to partly factor 

1 J A.% in the integrals, as seen in Eq. l96t . which is convenient for numerical 

computations. Again, it is useful to express Eg. IIH.6I I in terms of the 

_ Qi^+WlikxDBi^) — 9 i _W2(k±T>9i t -) real-space two-point correlation function, which yields 



0? , - 8 2 



(G.2) 



T?\6) - 2 I dxi{x) 



and W-) , associated with a full circular window, was defined in Eg . (1 12ft . n J ^(^ 2 .+ — ) 

In terms of the two-point correlation function, Eq. iG.U also writes as 



— , — ^ 



dx x 



-fl?,_W a (— ,— 

\ x x / 



(H.7) 



= 92 : 02 j / ^ew^i-^fl?./* 3 with 



dx x 2 



©3 



1 _-p25)2 ^2 W2(a,6)= / du sin(u) Jo(au) W 2 {bu). (H.8) 



l-X> a fl?,/x 



Although there is no explicit expression for the integral IIH.8II for 
(G.3) arbitrary (a,b), for \a — b\ > 1 wc can use the properties 



b < a - 1 : W 2 = 

which avoids introducing oscillatory kernels. 6 > a + 1 : VF2 = 2/6 2 . l- tl,J J 



In the band \a — b\ < 1 one can check that W 2 (a, b) is positive and 

Appendix H: Computation of the covariance of w LS deca y s as ~ b ~ 2 for lar s e b ' so that the real-space expressionEHi ^ 



again more convenient than the Fourier-space expression JIT 
The low-order contribution ((94j to the covariance matrix of the esti- The second contribution {fSZJ arises from the three-point correla- 

mator w LS involves the angular average ll95l l. Using Limber's approx- tion f in Eq.(|9TJ. Using Eq.© it reads as 
imation it also reads as 

r°o a r LS(3,C) _ 4 f, -8T3 3S3 



We now compute the high-order terms of the covariance Cj^. 



(H.10) 



which are given in Eqs, ligifl l-l9"gll. A first contribution JSHJl arises from where the three terms in the brackets, which correspond to the three 
the product .j in Eq. lt92ll . Using Limber's approximation and diagrams in Fig. [T] are again geometrical averages along the lines of 

integrating over angles yields sight, which we compute with Limber's approximation. In particular, 

the first term factors as 

^s (4l «) = 2^ r d ^ vj - 2 _a r d% ^ -2 _ 2 = 

X J dk 1± dk 2± P(k 1± ;Zi)P(k 2± ;z 3 ) W^ 2) {k 21 _V 3 ) where £ (B) wag defined in Eq .jg|j ) while the secon d term reads as 



Introducing a Dirac factor jk2x — fjkix -xi), which we write 



H',iH',j' 252 



~ (21 - f21 

with the usual exponential representation in a fashion similar to X W- (|ku_ + kgjJX') W) (k 2 j_T>). (H.12) 
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With the same factorization method, and using the inverse Fourier 
transform of the 2D shell llG~2ll . 



e(6> g A) 
A 



wc obtain 



dk ± D 2 

V 2 f dk 
17 



k 2 J (kV9)W^\kV), 



,+ 



where wc introduced 



20 



da 



u 2 VT- 



(?) 



(H.13) 
(H.14) 

(H.15) 

(H.16) 
(H.17) 



The third term in Eq JH.lOI I is obtained from Eq.l lH.15t by exchanging 
the labels "j" and "j" . 

The third contribution (I98D arises from the four-point correlation 
r\ in Eq, ll92l) , As in App. [F] we must compute the various terms as- 
sociated with the diagrams of Fig. [2] with contributions (a) and (b) 
associated with the left diagram and contributions (c), (d), (e), and 
(f) associated with the right diagram. The first contribution (a) leads 
to 



C 



LS(4,a) 



N 



dxv^t^^eu.^i 

16 M J 



As in App. [F] this geometrical average factors as 



with 



£■ £ (6) 



V 2 



x W 2 (|ki 

2 



dk lx dk 2± p(fc 1± )P(fc 2± )iyf ) (fc 1± ©) 



k 2 _ L |X>9 3 ) 2 



(H.18) 



(H.19) 



(H.20) 
(H.21) 



Contribution (b) is the symmetric one of (a) with respect to i j. 
Next, contribution (c) involves the geometrical average 

£^-£i;j£^M which again factors as 




A£l=25.Nf=l.M>lel4 

- - Ail=25,Nf=2,M>lel4 
---- Ail=25,Nf=4.M>lel4 
Ail=50,Nf=l,M>lel4 

- - AQ=50, Nf=2, M>lel4 
Ail=50. Nf=4.M>lcl4 

- AQ=100, Nf=l, M>lel4 
AQ=100, Nf=2, M>lcl4 
A£l=100.Nf=4, M>lcl4 
Ail=25, Nf=l.M>5el4 

- - Ail=25,Nf=2.M>5el4 

AQ=25,Nf=4, M>5cl4 

- AQ=50, Nf=l, M>5el4 

- - Ail=50, Nf=2.M>5cl4 
---- AQ=50, Nf=4, M>5el4 

Ail=100.Nf=l,M>5cl4 

- - Ail=100.Nf=2, M>5cl4 

A£l=100.Nf=4, M>5cl4 

Ail=25,Nf=l,M>2el3 

- - AQ=25, Nf=2, M>2el3 

Ail=25.Nf=4.M>2el3 

AQ=50, Nf=l, M>2el3 

- - Ail=50,Nf=2,M>2el3 
-■- Ail=50.Nf=4,M>2cl3 

Ail=100.Nf=l,M>2el3 

- - Ail=100.Nf=2,M>2cl3 
Ail=100. Nl'=4. M>2cl3 



Fig. LI. Scaling of the number-counts signal-to-noise ra- 
tio by \J Ail as computed in the Horizon simulation, see 
Sect. 12.21 Different configurations are displayed according 
to the total surveyed area AfJ, the number of subfields 
Nf — A/", and the mass limit. In the right caption, AQ 
is expressed in deg 2 and the mass unit is /z _1 M . 




AQ=25, Nf=l, M>lel4 

AQ=25, Nf=2, M>lel4 

A£l=25.Nf=4.M>lel4 

Ail=50,Nf=l,M>lel4 

A£l=50.Nf=2.M>lel4 

Ail=50.Nf=4.M>lel4 

— Ail=100.Nf=l,M>lel4 

- - Ail=100.Nf=2, M>lcl4 

AQ=100, Nf=4, M>lel4 

— Ail=25,Nf=l.M>5el4 

- - Ail=25,Nf=2.M>5el4 

Ail=25,Nf=4.M>5el4 

A£l=50.Nf=l.M>5el4 

AQ=50, Nf=2, M>5el4 

AQ=50, Nf=4.M>5cl4 

AQ=100, Nf=l, M>5el4 

AQ=100,Nf=2, M>5el4 

A£l=100.Nf=4, M>5el4 

Ail=25,Nf=l,M>2el3 

A£l=25.Nf=2,M>2el3 

AQ=25, Nf=4, M>2el3 

AQ=50, Nf=l, M>2el3 

AQ=50, Nf=2,M>2cl3 

AQ=50, Nf=4.M>2cl3 

A£l=100.Nf=l,M>2el3 

A£l=100.Nf=2, M>2el3 

A£l=100.Nf=4, M>2el3 



AS), , 



The contribution (d) involves the average 



t(<)t, .A<>) 



T>3 

which also writes as 

20 



dUidUj 

{An) 2 



de,,d0i 



(H.22) 



(H.23) 



(H.24) 



Contribution (e) involves £ .£ ^).^?\ , which reads . 

v ' i ;i 1,1 V ',1 



do 1 



9? , - 



xecyi(e') / d^p ( cyl ( v / e 2 +e' 2 +2ee' cosl p), 



(H.25) 



whereas contribution (f ) is obtained from (e) by exchanging the labels 
"i" and "jr". 

Collecting all terms, the high-order contributions to the covari- 
ance matrix Cf'j are given by Eqs J96l l- ll98l . 



Fig. 1.2. Same as Fig. 11.11 but with a scaling that depends 
on the number of subfields: N~ n/A (Af2)( n + 2 )/ 4 , with n = 
-0.6 



Appendix I: Scaling of the number counts 
signal-to-noise in simulations 

We present here the result of scaling the number counts with the 
total surveyed area AQ and the number of subfields M ■ Figures 11.11 
and lI.2l show the scalings expected from Eq J35H in the shot-noise and 
sample- variance dominated regimes. Multiple survey configurations 
are explored by varying the total surveyed area (AO = 25, 50, and 
100 deg 2 ), the number of subfields (M = 1,2, and 4), and the mass 
threshold (M > 2 X 10 13 ,10 14 , and 5 X lO 14 fe- 1 M ). 

The weak scatter in those plots shows that (1356 provides a valid 
approximation of the signal-to-noise scaling with respect to AC and 
A/". In agreement with the discussion in Sect. [3~^TT1 and Fig. [7] at high 

redshift and for high mass, the scaling yj (AO) shown in Fig. 11.11 is 
best, as expected for the shot-noise dominated regime, whereas at low 
redshift and for low mass the scaling J\f~ n ^ (AC)^" 1 " 2 )/ 4 shown in 
Fig. 11.21 (with n = —0.6) is best, as expected for the sample-variance 
dominated regime. 



34 



P. Valageas et al.: Covariance matrices for halo number counts and correlation functions 




Fig. J.l. Left panel: cluster mass associated with a 50%, 80%, or 95% detection probability (from bottom to top), for 
the XXL selection function CI, as a function of redshift. Middle panel: minimum detectable cluster mass, as a function of 
rcdshift, for the Planck space mission. Right panel: cluster mass associated with a 50%, 80%, or 95% detection probability 
(from bottom to top), for the Erosita selection function as a function of redshift (we consider a flux limit of 4 x 10 -14 
ergs -1 cm -2 in the [0.5 — 2] keV band). 





h 






WMAP7 mean values 


0.702 


0.274 


0.816 


2 — a deviations 


0.73 


0.289 


0.864 


line style 


dashed 


dot-dashed 


dotted 



Table K.l. Three alternative cosmologies. 

Appendix J: Selection functions used for various 
surveys 

We give in Fig. IJ.ll the selection functions that we use for several 
cluster surveys investigated in Sect. [6] For Planck, the curves shown 
in the middle panel corresponds to a 100% detection probability. 

For the other surveys studied in Sect. [5] we consider simple mass 
thresholds, rather than detailed selection functions. More precisely, 
we consider halos above the two thresholds 5 X 10 13 /i — 1 Mq and 5 X 
10 14 h- 1 M e for DES and Euclid, and above 5 X 10 14 h- 1 M Q for SPT. 

Appendix K: Dependence on cosmology 

In this appendix we investigate the dependence of the results obtained 
in Sect.[6]on the value of the cosmological parameters. Thus, in addi- 
tion to the WMAP7 cosmology recalled in the first line of Table |K~T1 
which was used in Sect. [6] we also consider the three modified cos- 
mologies where one among the three parameters h, f2 m , and erg is 
changed to the values shown in the second lin e of Table | K.1 | They 
correspond to "2— a" deviations from WMAP7 (Komatsu et al. 2011) 
and describe current uncertainties. (When we vary f! m we keep a flat 
ACDM universe and we change Q^c according to Q^c = 1 — f2m.) 

Thus, we compare in Figs. IK. II IK. 21 and IK. 31 the three curves 
obtained for these three alternative cosmologies with the curve that 
was obtained in Sect. [6] for the fiducial WMAP7 cosmology. To avoid 
overcrowding the figures we only consider the all-sky surveys, Planck, 
Erosita, and Euclid. We can see that the main features of these figures 
are not modified when we consider these alternative cosmologies, so 
our results and conclusions are not sensitive to the precise value of the 
cosmological parameters. As expected, we can also check that shot- 
noise effects become less important, with respect to sample- variance 
contributions, when ag is increased. 
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